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NOVEL EXPRESSED GENES 



This ^Ucation claims the braiefit of U.S. Serial No. 10/1 12,372, ffled April 1, 2002, 
U.S. Serial No. 60/382. 614, filed May 24, 2002, U.S. SaialNo. 10/164,717, filed June 10, 
2002, U.S. Serial No. 10/167,631, filed June 13, 2002, U.S. Serial No. 10/177,917, filed June 
24, 2002, and U.S. Serial No. 60/399,125, filed 30 July 2002, which are hereby incorporated 
by reference in their raitirety. 

DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows the expr^sion of OTB0949 in human tissues. PGR was performed 
using SEQ NO 3 as the forward primer, and SEQ ID NO 4 as the reverse primer. 

Fig. 2 shows the expression pattern of human OTB182, an integral membrane protein, 
in human tissues. To detect gene expression, PGR was carried out on aliquots of the 
normalized tissue samples using a forward (SEQ ID NO 13) and reverse (SEQ ID NO 14) 
gene-specific primers. 

Fig. 3 shows the amino add alignment of human OTB182 (SEQ ID NO 12) with 

mouse AK003645 (SEQ ID 15). 

Fig. 4 (A-D) shows the amino acid alignments of a human transient receptor potential 
cation channel (TRPCQ with related channel family members. Human sequences are TRPM 
(SEQ ID NO 17), human AB046836 (SEQ ID NO 19), human XM;_036123 (SEQ ID NO 
18), and mouse XM_140575 (SEQ ID 20). 

Fig. 5 shows tiie expression pattern of human TRPCC in human tissues. To detect 
gene expression, PGR was carried out on aliquots of the normalized tissue samples using a 
forward (SEQ ID NO 21) and reverse (SEQ ID NO 22) gene-spedfic primers. 

Fig. 6 shows the amino acid sequence alignments between difTerent forms of the 
human melanocortin-1 receptor. NM_002386 or MCR-1 A (SEQ ED NO 30). MGR-IC 
(SEQ ID NO 26). MCR-IB (SEQ IDNO 31). 

Fig. 7 shows a sdiematic of the exon sizes for the melanocortin-1 gene and the 
tubulin gene (exon 7). 

Fig. 8 shows the expression pattern of OTB860 in human tissues. SEQ ID NOS 40 

and 41 are the primer sequences. 

Fig. 9 (A-G) shows the amino acid aUgimients of OTB860 (SEQ ID NO 39) and 
KIAA1678 (SEQ ID NO 42). 
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Fig. 10 (A and B) is the amino acid alignments of the different splice variants of 
human TARPP, Brl37A (SEQ ID NO 46), B (SEQ ID NO 48), C (SEQ ID NO 50), D (SEQ 
ID NO 52; SEQ ID NO 14, NM_016300), and E (SEQ ID NO 44), and partial clone 
AL133109 (SEQ ID NO 55). 

Fig. 1 1 is a schematic drawing showing the differences between the various forms of 

human TARPP, 

Fig. 12 (A-C) shows amino acid ahgmnents of the different splice variants of human 
TARPP (Brl37A, B, C, D, and E) with mouse TARPP (NM_033264; SEQ ID NO 53). 

The foUowmg procedure was used for the expression profile. A twenty-four tissue 
panel was used (lanes from left to right): 1, adrenal gland; 2, bone marrow; 3, brain; 4, 
coloI^ 5, heart; 6, intestine; 7, pancreas; 8, liver; 9, lung; 10, lymph node; 1 1, lymphocytes; 
12, mammary gland; 13, muscle; 14, ovary; 15, pancreas; 16, pituitary; 17, prostate; 18, skin; 
19, spleen; 20, stomach; 21, testis; 22, thymus; 23, thyroid; 24, uterus. The lane at the fer left 
of each panel contains molecular weight standards. The results were obtained according to 
the following procedures: 

Polyadenylated mRNA was isolated from tissue samples, and used as a template for 
first-strand cDNA synthesis. The resulting cDNA samples were normalized using beta-actin 
as a standard. For the normalization procedure, PGR was performed on aliquots of the first- 
strand cDNA using beta-actin specific primers. The PGR products were visualized on an 
ethidium bromide stained agarose gel to estimate the quantity of beta-actm cDNA present in 
each sample. Based on these estimates, each sample was diluted with buffer until each 
contained the same quantity of beta-actin cDNA par unit volume. 

To detect gene expression, PGR was carried out on aliquots of the normalized tissue 
samples using a forward and reverse gene-specific primers. The reaction products were 
loaded on to an agarose (e.g., 1.5-2%) gel and separated electrophoretically. 

DESCRIPTION OF THE INVENTION 
The present invention relates to all &cets of the novel genes described harein, 
polypeptides encoded by fliem, antibodies and specific binding partners fliereto, and their 
applications to research, diagnosis, drug discovery, therapy, clinical medicine, forensic 
science and medicine, etc. The polynucleotides and polyp^tides are usefiil in variety of 
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ways, including, but not limited to, as molecular markers, as drug targets, and for detecting. 



invention. The identification of specific genes, and groups of genes, expressed in pathways 
5 physiologically relevant to particular tissues, pennits the definition of fimctional and disease 
pathways, aud the delineation of targets in these pathways which are usefiil in diagnostic, 
therapeutic, and clinical applications. The present invention also relates to methods of using 
the polynucleotides and related products (proteins, antibodies, etc.) in business and 
computer-related methods, e.g., adv^tising, displaying, offering, selling, etc., such products 
1 0 for sale, commercial use, licensing, etc. 



OTB0949 is a polynucleotide (SEQ ID NO 1-2) which is expressed predominantly in 
brain tissue. Low levels of expression are observed in oth^ tissues, e.g., adrenal g^and, 

15 mammary gland, pituitary, stomach, and testes, but brain expression is at least 100-fold 
higher. See, e.g., Fig. 1 . Because of its selectivity for brain, OTB0949 can be used as a 
molecular marker for brain tissue, e.g., in pathology and cytology, as well as a target, e.g., to 
ablate brain tissue, to deliver drugs to brain cells, etc. In the brain, OTB0949 is highly 
expressed in amygdala, hippocampus, thalamus, and retina. OTB0949 can also be a useful in 

20 diagnostics and therapeutics to treat neurological and visual disorders. 

The brain is one of the most compUcated and least understood organs in the 
mammaUan body. Anatomically, it is composed of four different regions: (1) cerebral 
hemispheres, (2) diencephalon (thalamus, hypothalamus, and qpithahnus), (3) brain stem 
(midbrain, pons, and medulla oblongata), and (4) cerebellum. These can be further 

25 subdivided. For instance, the cerebral hemispheres contain cerebral cortex and basal ganglia 
(caudate nucleus, putamen, globus pallidus, lentifonn nucleus, corpus striatum, amygdala). 
The midbrain contains, e.g., cerebral peduncles, corpora quadrigemina, colliculi, substantia 
nigra, and the red nucleus. Other regions and subdivisions of interest include hypothalamus, 
pituitary, cranial nerves, pineal, gray matter, white matter, raphe nucleus, limbic system, etc. 

30 Various cell types are found in the brain, including supporting cells, such as neuroglia, glia, 
astrocytes, microglia, ependymal cells, oligodendrocytes, and Schwaim cells, neurons, such 



diagnosing, staging, monitoring, prognosticating, prevCTting or treating, detCTnining 
predisposition to, etc., diseases and conditions, associated with genes of the present 



OTB0949 
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as multipolar, bipolar, unipolar, Purkinje, and pyramidal cells. 

The gene is located at chromosomal position 12q24^. Several neurological diseases 
have been mapped to this region, e.g., spinocerebellar ataxia 2 (associated with a mutation in 
the ataxin-2 gene), spinal muscular atrophy (OMIM 158590), and amyotrophy (OMIM 

5 181405). IMsruption of OTB0949 (e.g., in the conresponding gene a transgenic animal) ca^ 
result in a brain disorder or susceptibility thereto, including those mentioned above. 

OTB0949 is coded for in a single exon, and comprises a short coding sequence of 
about 135 amino acids (SEQ ID NO 2), It contains a stretch of hydrophobic amino acids 
from about positions 76-100 (SEQ ID NO 2). Examples of promoters include, e.g., SEQ ID 

10 NOS 5-10 located in contig NT_009775 at about 557-607, 1263-1313, 1591-1641, 1635- 
1685, 1714-1764, and 1936-1986, respectively. 

The present invention also relates to polypeptides of OTB0949, e.g., an isolated 
human OTB0949 polypeptide comprising or having the amino acid sequence set forth in SEQ 
ID NO 2, an isolated human OTB0949 polypeptide comprising an amino add sequence 

1 5 having 80, 85, 90, 95, 97, 99% or more amino acid sequence identity to the amino acid 
sequence set forth in SEQ ID NO 1, and optionally having one or more of OTB0949 
activities, such as cell signaling activity, secretory pathway activity, etc. Fragments specific 
to OTB0949 can also used, e.g., to produce antibodies or other immune responses, as 
competitors to any of its activities, etc. These fiagments can be referred to as being "specific 

20 for*' OTB0949. The latter phrase, as aheady defined, indicates that the peptides are 

characteristic of OTB0949, and that the defined sequences are substantially absent firom all 
other protein types. Such polypeptides can be of any size which is necessary to confer 
specificity, e.g., 5, 8, 10, 12, 15, 20, etc. 

25 OTB182 

OTB182 is an integral membrane protein comprising 307 amino acids (SEQ ID NO 
12). Using SMART (e.g., Schultz et al., Proc. Natl Acad. Scu, 95:5857-5864, 1998; Letunic 
et al.. Nucleic AcidRes.^ 30:242-244, 2002), the protein is predicted to have seven membrane 
spanning regions at about ammo adds positions 45-67, 87-109, 129-151, 161-183, 196-218, 
30 231-253, and 278-297. There is a putative signal sequence (at amino adds l-26)atitsN- 
terminus which also overlaps with an dght transmembrane spanning domain (at amino adds 
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10-32). 

OTB182 is expressed piedominantly in excitable tissues, e.g., btain, heart, and 
muscle, wiflivray low expression observed in prostate tissues. See. Fig. 2. In tiie brain, it is 
e3q)ressed predominantly in tiialamus. It also expressed in neural stem cells. Its egression 

5 in excitable tissues makes 013182 a highly selective marker for excitable tissues, as well as 
indicating a functional and/or developmental role in this tissue type. There are a number of 
genes whose expression is restricted predominantly to excitable cells, e.g., GABA-interacting 
fector-l (GRIF-1; Beck et al., J. Biol. Chem., 28 May 2002); calcium cahnodulin dependent 
(CaM) kinase (e.g., Losefli et al.. Brain Research, 869(1 -2): 137-145, 2000); sodium channel 

10 types (e.g., Schaller et al., J. Nettrosci, 12(4): 1370-81, 1992); calcium dependent 

mitochondrial solute carrier (e.g., Del Arco et al., J. Biol Chem., 273(36):23327-23334, 
1998). 

A mouse homolog of OTB182 is AK003645 (SEQ ID NO 15) that codes for a 153 
amino acid polypeptide. It shares about 92% sequence identity fiom amino adds 1-122, and 

15 sharply diverges from that point onward. See. Fig. 3. Murine OTB182 maps to 

chromosomal location 1 1E2. The present invention relates all transcripts associated wifli the 
AK003645 gene loci. The degree of nucleotide sequence identity between human and mouse 
(AK003645) is about 83% from about nucleotide position 71-437 of SEQ ID NO 12. 
OrB182 is located at chromosomal band 17q25. A genetically-inherited 

20 neuromuscular disease, hereditary neuralgic amyotrophy (HNA) has been mapped to this 
locus. See, e.g.,Jeamietetal.,J\feMro/ogj;, 57:1963-1968, 2001. hi addition, a hereditary 
hearing loss maps to 17q25 (e.g., DFNA20, Mordl et al.. Genomics, 63:1-6, 2000) and 
mental retardation (Rio et al.. Human Genetics, 108:51 1-515, 2001). Examples of specific 
polynucleotides are SEQ ID NOS 13 and 14. 

25 Diseases or disorders which can be treated in accordance with the present mvention 

mclude, but are not limited to neuropathy, neuralgic amyotrophy (e.g., HNA), myopathy, 
sensorineural hearing loss (e.g., DFNA20), mental retardation, neuromuscular disorders, 
brain cancer, such as a neuioblastoma, and other diseases and conditions involving heart, 
brain, and muscle tissues, 

30 A transonic animal with 018182 functionally disrupted can show a defect in an 

excitable cell. Such defect, includes, e.g., developmental defects, defects m the functional 
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activity of the cell, e.g., in excitability, membrane conductance, response to stimuli, signal 
transduction, or any of the disorders mentioned herein. 

Antibodies to OTBl 82 can also be produced, e.g., an antibody which is specific-for: 
an epitope selected from amino acids 123-307 of SEQ ID NO 12, or comprising amino acid 
5 27, 47, 64, 66, 75, 78, 105, 1 1 1, or 1 13 of SEQ ID NO 12 

Human transient receptor potential cation channel (TRPCC) gene and polypeptide 



is selectively expressed in brain, kidney, and pituitary, with very low e^qiression observed in 

1 0 testis and ovary. By the phrase "selectively expressed," it is meant tiiat a nucleic acid 

molecule, when produced as a transcript, is characteristic of the tissue or cell-type in which it 
is made. This can mean that the transcript is expressed only in that tissue and in no other 
tissue-type, or it can mean that the transcript is expressed preferentially, differentially, 
predominantly, and more abundantly (e.g., at least 5-fold, 10-fold, etc., or more) in that tissue 

1 5 when compared to other tissue-types. 

The nucleotide and amino acid sequences of human TRPCC are shown in SEQ ID 
NOS 16 and 17. Analysis of its primary stmcture indicates the presence of six 
transmembrane domains at about amino acids 870-892, 901-1 1 12, 904-921, 936-958, 971- 
990, 1005-1024, 1085-1107 of SEQ ID NO 17,howevear,by analogy to other ion channels, it 

20 is generally believed to have only six transmembrane spanning regions. See, e,g., Clapham et 
al.. Nature Reviews, Neuroscience, 2:387, 2001. The ion transport domain comprises amino 
adds 901-1 1 12. There is also a putative transmembrane domain at the N-terininus at about 
amino acids 5-24. According to the six-transmembrane model, both the N- and C-terminus 
of the protein are intracellular, and provide a scaffolding for interaction with other proteios. 

25 The human TRPCC contains 25 exons. The present invention relates to any isolated 

introns and exons that are present in the gene. Intron and exon boundaries can be routinely 
determined, e.g., using the sequences disclosed herein. 

Partial sequences for human TRPCC were previously identified (e.g.. Accession 
Numbers AB046836 and XM_036123). For example, human AB046836 (SEQ ID 19) is 

30 incomplete, coding for 1017 amino acids (See Fig. 4, AB046836), and lacks the first 690 
amino adds of human TRPCC, but shares about 99% identity with TRPCC along the rest of 



Hvmian TRPCC codes for a polypeptide of 1707 amino adds. As shown in Fig. 5, it 
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its lengflL Another partial sequence, human XM_036123 (SEQ ID NO 1 8) codes for 988 
amino adds (See Fig 4, XM_036123), laddng the first 719 amino adds of human TRPCC, 
but shares 100% identity with TOPCC along the rest of its length (See Fig 4). XM_140575 
(SEQ ID NO 20) appears to be a homolog of human TRPCC, and shares about 94% sequence 
5 identity from about amino adds 82-693, or about amino adds 345-956 of human TRPCC 
(SEQ ID NO 17). Amino adds 1-81 and 694-736 (see Fig. 4) of the mouse homolog have 
low sequence identity with human TRPCC. Aligmnent with mouse genomic DNA using 
Spidey (NCBI) indicates that amino adds 1-80 of XM_1 40575 are derived from exons 1 and 
2 of the genomic DNA, and amino adds 694-736 are derived from exon 7 of the mouse 

10 genomic DNA. XM_140575 is located on mouse chromosome 19B. 

TRPCC maps to chromosomal region 9q21 • 1 . Strikingly, hypomagnesemia with 
hypocalcemia (OMIM 602014) are known to be detemiined by a miitation witMn 9q21 
(Walder et al.. Human Molecular Genetics, 6: 1491-1497, 1997), as would be expected with 
a chaimel responsible for cation conductance. Consistra^t with its expression in brain, a 

1 5 susceptibility to amyotrophic lateral sclerosis with frontotemporal dementia (OMIM 1 05550) 
was mapped to this same chromosomal locus (Pinsky et al., Clinical Genetics, 7:186-191, 
1975; Hosier et al., JAMA, 284:1664-1669, 2000). In addition, schizophrenia (Hovatta et al.. 
Am. J. Hum. Genet, 65: 1 1 14-24), and familial dyskinesia/fadal myokymia (Fernandez et al., 
Ann. Neurol, 49:486-92, 2001) are also assodated with this gene locus. Nucleic adds of the 

20 inresent invention can be used, e.g., as linkage markers, diagnostic targets, and ttierapeutic 
targets for any of the motioned disorders, as well as any disorders or genes mapping in 
proximity of TRPCC. 

TRCC polynucleotides, polypeptides, ligands, and binding partners thereto, can be 
used in a number of useftil ways. For example, binding partners, such as antibodies and 

25 ligands, can be used to selectively target agents to brain, kidney, and other tissues in which it 
is expressed for purposes including, but not limited to, imaging, diagnostic, therapeutics, etc. 
Tmaging of tissues Can be fadlitated using agents such as TRPCC antibodies that can be used 
to target contrast agents to a specific site in the body. Various imaging techniques have been 
used in this context, including, e.g.. X-ray, CT, CAT, MRI, ultrasound, PET, SPECT, and 

30 sdntographic. A reporter agent can be conjugated or assodated routinely with a TRPCC 

antibody. Ultrasound contrast agents combined with ligands such as antibodies are described 



wo 03/085095 PCT/US03/09921 

-8- 

in, e.g., U.S. Pat Nos 6^64,917; 6,254,852; 6,245,318; and 6,139,819. MRI contrast agents, 
such as metal chelators, radionucleotides, paramagnetic ions, etc., combined with selective 
targeting agents are also described in the literature, e.g., in U.S. Pat Nos. 6,280,706 and 
6,221,334. The methods described Ih^ein can be used generally to associate TRPCC and 

5 ligands thereof with an ag^t for any desired purpose. 

An active ag^t can be associated in any manner with an TRPCC ligand that is 
efifective to achieve its delivery to the target The association of the active agent and the 
ligand ("couphng") can be direct, e.g., through chemical bonds between the binding ligand 
and the agent or via a linking agent, or the association can be less direct, e.g., where the 

10 active agent is in a liposome, or other carrier, and tihie ligand is associated with the liposome 
surface. In such case, the Ugand can be oriented in such a way that it is able to bmd to 
TRPCC on the sur&ces of kidney or brain cells. 

Usefijl human TRPCC polypeptides and corresponding nucleic adds include 
polypeptides comprising amino adds 1-88, 5-24, 1-690, 1-719, and fragments thereof (See 

15 SEQ ID NO 17 and Fig. 4). Nucleic acids and polypeptides can be used as probes (e.g., in 
PCR, in Northem blots, etc.), as diagnostic agents, to generate antibodies, as vaccines, to 
produce recombmant proteins, as antisense, etc. A spedfic polynucleotide according to the 
present invention can be determined routinely. Examples are specific probes are SEQ ID 
NOS 21-24, e.g., where SEQ ID NOS 23 and 24 can be used as forward and reverse PCR 

20 primers, respectively, to amplify a portion of amino add region 1-1 60 of SEQ ID NO 17. 

TRPCC has a number of biological activities, including, e.g., cation transport, signal 
transduction, protein binding, etc. By "signal transduction" is meant the activation of a 
chain of events that alters the concentration of one or more small intracellular signaling 
molecules (second messengers), e.g., cyclic AMP, caldum ions, as described in Sabala et al., 

25 British Journal of Pharmacology, 132:393-402, 2001. By "cation transport' is meant the 
influx or efQux of a cation, e.g., caldum, magnesium, into or from a cell. Mizuno et al.. 
Molecular Brain Research^ 64:41-51, 1999. Protein binding indicates the abiUty of the 
protein to interact with other protems, e.g., as the N-terminus interacts with intracellular 
protems. These activities can be determined routinely. Signal transduction can be assessed 

30 by expression of TRPCC in cells, etc., and measurement of the concentrations of elidted 

second messengers or byproducts, e.g., Ca^ or Mg^ or cAMP, inositol, etc., by, e.g., atomic 
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absorption specbometiy (ThennoEleinaital SOLAAR AA spectromet^), 
radioimmunoassay, etc. Sano et al. Science, 293:1327-1330, 2001. Cation transport can be 
assessed by measurement of changes in ionic currents by whole-cell patoh-clamp analysis. 
For instance, cells or oocytes can be transfected with a polynucleotide of &e present 
5 invention and then analyzed for egression of calcium channel activity, e.g., using patch 
clamp, calcium activated dyes, etc.. See, also, e.g., Strubing et al.. Neuron, 29:645-655, 
2001; Sano et al.. Science, 293:1327, 2001; Ohki et al., J. Biol. Chem., 275:39055-39060, 
2000; Boulay et al., J. Biol. Chem., 272:29672-29680, 1997. 

f 

The present invention relates to an isolated polynucleotide comprising, e.g., a 
10 polynucleotide sequence coding wifliout interruption for a human TRPCC polypq)tide, or 
complement thereto, said TEIPCC having 80%, 85%, 90%, 92%, 95%, 99%, or more amino 
add sequence identity along its entire length to flie sequence comprising amino adds 1-690 
of SEQ ID NO 17, and 80%, 85%, 90%, 92%, 95%, 99%, or more amino add sequence 
identity along its entire length to the sequ^ce comprising fix)m amino adds 691-1707 of 
15 SEQ ID NO 17, and which has, e.g., cation transport, signal transduction, or protdn binding 
activity. 

Antibodies can be prepared against spedfic epitopes or domains of TRPCC, e.g., 
amino adds 2-30, 773-789, 870-887, 905-913, 943-958, 969-986, 1005-1022, 1087-1114, 
1125-1131, 789-870, 913-943, 986-1005, etc. 

20 Detection can be desirable for a variety of different purposes, including research, 

diagnostic, prognostic, and forensic. Diagnostic purposes included testing patients and their 
fiimilies for the presence of mutations assodated with hypomagnesemia with hypocalcemia 
or amyotrophic lateral sclerosis with frontotemporal dementia. The selected mutant alleles, 
SNPs, polymorphisms, etc., can be used diagnostically to detennine whether a subject has, or 

25 is susceptible to a disorder assodated with TRPCC, as well as to design therapies and predict 
the outcome of the disorder. Methods involve, e.g., diagnosing a disorder associated witii 
TRPCC or detennining susceptibility to a disorder, e.g., hypomagnesemia with hypocalcemia 
or amyotrophic lateral sclerosis with frontotemporal dementia, comprising, detecting the 
presence of a mutation in a TRPCC gene (such as a mutation in SEQ ID NO 16, or variants 

30 The sequences of TEIPCC genes can also be compared, e.g., between a normal gene 

as shown in SEQ ID NO 16 and the sequence of a gene from a patieaat with the disorder, e.g.. 
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hypomagnesemia with hypocalcemia. 

Fragments specific to TRPCC can also be used, e.g., to produce antibodies or other 
inmiime responses, as competitors to nucleotide binding, Ugand binding, etc. or as, e.g., 
inhibitors or stimuli in signal transduction pathways. These fragments can be referred to as 

5 being "specific for" TRPCC. The latter phrase, as already defined, indicates fliat the peptides 
are characteristic of TRPCC, and that the defined sequences are substantially absent fixwn all 
other protein types. Such polypeptides can be of any size necessary to confer specificity, e.g., 
5, 8, 10, 12, 15, 20, etc. Examples of polypeptides include but are not limited to polypeptides 
that comprise the following amino acid residues: 2-60, 598-660 of SEQ ID NO 17, or 

1 0 fragments thereof. 

Biological activities of TRPCC include, e.g., cation channel activity, signal 
transduction activity, and protein binding activity. As discussed above, the biological activity 
of TRPCC can be measured routinely. For example, if agents are to be identified which 
modulate the chaimel activity of TRPCC either electrophysiology or calcium unaging can be 

1 5 used to assess their effects, e.g., using fluo-3, Fura-red, Ca-sensitive chemi-luminescent 
probes, etc. (e.g., kits are commercially available from Molecular Probes) and a laser 
scanning confocal microscope to visualize the changes in intracellular calcium as a result of 
modulation of TRPCC. 

A transgenic animal, or animal cell, lacking one or more fimctional TRPCC genes can 

20 be useM in a variety of applications, including, as an animal model for hypomagnesemia 
with secondary hypocalcemia, amyotrophic lateral sclerosis with fix>ntotemporal dementia, 
etc., drug screening assays (e.g., for signal transduction mediated by agents other than 
TRPCC; by making a cell deficient in TRPCC, the contribution of other receptors to, e.g., 
Ca^^ modulation can be specifically examined), as a source of tissues deficient in TRPCC 

25 activity, etc. Such an animal can show a defect in cation (e.g., calcium) conductance, e.g., an 
impairment in the permeation of an ion through the channel. 

An isolated polynucleotide can comprise, e.g., a polynucleotide sequence coding 
wifliout interruption for a human TRPCC polypeptide, or complem^t thereto, said TRPCC 
having 90% or more amino acid sequence identity along its entire length to the sequence 

30 comprising amino adds 1-690 of SEQ ID NO 17, and 90% or more amino add sequence 
identity along its entire length to the sequence comprising from amino adds 691-1707 of 
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SEQ ID NO 17, and which has cation transport activitjr. 

The present invention also relates to a methods of identifying a mutation associated 
with amyotrophic lateral sclerosis with j&ontotemporal demaitia, comprising: comparing the 
stmcture of: genomic DNA comprising all or part of human TRPCC, mRNA comprising all 
5 or part of human TRPCC, cDNA comprising all or part of human TRPCC, or a polypeptide 
comprising all or part of human TRPCC, with the complete structure of human TRPCC as set 
forth in SEQ ID NO 16, in a patient having amyotrophic lateral sclerosis with frontotemporal 
dementia, or a family member thereof 

10 Melanocortin 

The present invention relates to novel forms of a melanocortin- 1 receptor (also known 
as *TvICR-l" or alpha-melanocyte stimulating hormone receptor). It is highly expressed in 
melanocytes, and is a key component of the pathway which modulates skin and hair 
pigmentation. Moreover, certain alleles of MCR-1 are associated with a higji risk of 

15 melanoma. MCR-1 is also expressed in other tissues, including monocytes, mast cells, 
placenta, pituitary, and endothelial cells. 

MCR-1 belongs to the G-protein coupled receptor (GPCR) super-family. Its 
expression is restricted to melanocytes and few other cell types, such as monocytes, mast 
cells, and endothelial cells. See, e.g.. Smith et al., Gene, 281 :81-94, 2001; Scholzen et al., 

20 Annals of the New York Academy of Sciences, 885:239-253 (1999). Stimulation of the 
receptor by its natural ligands (e.g., alpha-melanocyte stimulating hormone or "a-MSH") 
causes an increase in cAMP levels which, in turn, stimulates intracellular tyrosinase activity. 
Licreased activity of the tyrosinase enzyme drives the conversion of phaeomelanin CycUow 
and red pigments) to eumelalanin (brown and black pigments). 

25 The MCR-1 gene is located at chromosomal position 1 6q24. It is adjacent to the 

tubulin TUBB4 gene, and its 3' region overlaps with the tubulin promote (Smith et al.). 
Transcripts containing genie material from both MCRl and TUBB4 have been identified, 
including transcipts which contain coding sequences from both. See, e.g., NCBI accession 
number BC020171. These may be involved in cancer. 

30 Almost 40 different polymoiphisms in the MCR receptor have been iden^^ See, 

Sturm et al.. Gene, 277:49-62, 2001; Table 1. Several of these (e.g., ArglSlCys; Argl60Tip; 
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Asp294His) are strongly assodated wifli red hair, feir skin, and poor tanning ability. It has 
been reported that these alleles are nonfunctional receptors and do not stimulate cAMP 
production when stimulated by MSH. See, Table 2. As a consequence, phaeomelanin is not 
converted to eumelalanin, and skin and hair color reflect the cell's high content of the yellow 

5 and red phaeomelanin pigments. Significantiy, mdividuals who have these alleles are also at 
a higher risk for skin cancers, such as basal cell carcmoma, squamous cell carcinoma, and 
melanoma. See, e.g., Sturm et al.. Am. J. Hum. Genet., 6 (supplement to volume 67): 16, 
Oct 2000. See, also OMIM, No. 155555 for other information on MCR-1, including disease 
information, polymorphisms, etc. 

10 The present invention relates to novel forms of MCR-1 . hi one embodiment, the 

present invention relates to a novel MCR-1 variant, MCR-1 type C or MCR-IC, which 
possesses a unique carboxy-terminus. Previous reports had identified a 3 17 anuno add form 
of MCR-1 C*MCR-1 A") in a number of different species, mcluding human (SEQ ID NO 30), 
chimpanzee, muskox, sheep, cow, horse, dog, and fox. This form was characterized as fiill- 

15 length. Several minor size variants were observed, as well, e.g., in mouse (315 amino add 
adds), in pig (310 amino acids), and in cow (321 amino adds). A second form, MCR-IB, 
was (SEQ ID NO 31) also reported that had an additional 65 amino acids at its taminus (Tan 
et al., FEBS Letters, 451:137-141, 1991; WO 00/39147). The present invention relates to a 
third form of MCR-1 (MCR-IC) that comprises 32 carboxy-amino acids (amino acids 367- 

20 398 of SEQ ID NO 26) not previously identified m any melanocortin receptor variant. This 
novel form comprises part of the new carboxy terminus identified m MCR-IB, but diverges 
firom it at amino acid position 367. See, Fig. 6. 

Fig. 7 shows exons which have been detected in melanocortin-1 receptors. Exons 1, 
2, and 3 contain MCR coding sequences; Exons 5, 6, and 7 contain tubulin coding sequences. 

25 MCR-1 A (e.g., NM_002386: SEQ ID NO 30) contains exon 1, and MCR-IB contains 

exons 1 and 2. MCR-IC contains coding sequence from exons 1-3. As indicated by the stop 
codon TGA, exon 3 comprises both coding and noncoding sequence. MCR-IC can also 
contain noncoding sequences, e.g., exons 4, 5, 6, and/or 7 (e.g., for a total of exons 1-7). 
BC020171, mentioned above, contains the coding sequence fix>m exons 1 and 2 fiised to the 

30 coding sequmces of tubulin in exons 5-7. Examples of promoters for MCR-IC include, e.g., 
SEQIDNOS 35-37. 



wo 03/085095 PCT/US03/09921 

-13- 

The piesCTt invention also relates to a polymorphism at amino add position 120, 
where an isoleucine Q) is replaced witii a threonine (T). Isoleucme is present at amino add 
position 120 in most melanocortin receptor-1 homologs, exc^t pig which has a methionine 
substitution. This polymorphism may affect the recq)tor'sfunctioxiaHty^ Analysis of the 

5 transmembrane structure using TMHMM v. 2.0 (Krogh et al., Journal of Molecular Biology^ 
305(3):567-580, January 2001; Sonnhammer et al., In J. Glasgow et al., editors. Proceedings 
of the Sixth International Conference on Intelligent Systems for Molecular Biology, pages 
175-182, Menlo Park, CA, 1998. AAAI Press; MoIIct et al., Bioinformatics, 17(7):646-653, 
July 2001) reveals a different number of predicted transmembrane sequences than the 

1 0 isoleucine isoform. 

The present invention relates to any polynucleotide, or polypeptide encoded thereby, 
which codes for MCR-IC, including receptors having any polymorphism, such as the 
naturally-occurring polymorphisms listed in Tables 1 and 2, and tiiose disclosed herein. 
Examples include SEQ ID 27 having 120T and 163Q, SEQ 28 having 1201 and 163R, and 

15 SEQ ID 29 having 1201 and 163Q. It also includes polynucleotide and polypeptide fragments 
which are specific for MCR-IC (e.g., 367-398 of SEQ ID NO 26 and fragments thereof), and 
polynucleotides and polypeptides which comprise such specific fragments. For example, the 
present invention relates to a polynucleotide comprising a coding sequence for amino add 
367-398 of SEQ ID NO 26, or fragments thereof, such as any five amino acid sequence 

20 contained therein. 

The present invention also relates to an isolated polynucleotide comprising, a 
polynucleotide sequence coding without interruption for a human MCR-IC, said MCR-IC 
having about 80%, 85%, 88%, 89%, 90%, 92%, 95%, 99%, etc., or more amino acid 
sequence identity along its entire length to the amino acid sequence set forth in SEQ ID NO 

25 26, or a complement thereto, and which has ligand-binding activity, G-proteia binding 

activity, or cAMP production activity. For example, such a polynucleotide can comprise one 
or more of the polymorphisms listed in Tables 1 and 2 (e.g., if 36 of the listed 
polymorphisms were present in such a polynucleotide, it would have about 90% (360/398) 
sequence identity along its entire length to the amino add sequence of SEQ ID NO 26* A 

30 corresponding amount of nucleotide is included, e.g., 90%, 92%, 95%, 97%, 98%, 99%, or 
more. 
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Similarly, the present inveation relates to an isolated polynucleotide comprising a 
polynucleotide sequence coding without interruption for a human MCR-IC, or complement 
thereto, said MCR-IC having 80%, 84%, 85%, 86%, 88%, 90%, 95%, or more amino add 
sequence identity along its entire length to the sequence comprising amino adds 1-3 16 of 
5 SEQ ID NO 26, and 85%, 90%, 92%, 95%, etc. or more amino acid sequence identity along 
its entire length to the sequence comprising from amino adds 317-398 of SEQ ID NO 26, 
and which has ligand-hinding activity, G-protein binding activity, or cAMP production 
activity. 

As stated, a polynucleotide can code for a polypeptide having one or more of the 

10 following activities, ligand-binding activity, G-protdn binding activity, cAMP production 
activity, or other functional activities. Ligand-binding activity indicates the ability of MCR- 
IC to bind spedfically to a receptor ligand, such as a ACTH, MSH, etc. Ligand binding 
activity can be using a radioactive or otherwise labeled ligand, or whole-cell assays using 
labeled ligands. See, e.g., WO0039147, W09957148, U.S. Pat. Nos. 5,731,408, 6,100,048, 

15 and 6,350,760; Libert et al.. Pigment Cell Res., 2:510-518, 1989. 

G-protein binding activity indicates the ability of the receptor protein to bind to a G- 
protein. Such binding does not determined routinely, e.g., using filtration assays necessarily 
have to be productive, i.e., the binding does not have to result in stimxilation of the signal 
transduction cascade. G-protein binding can be measured using in vivo and in vitro binding 

20 assays, as well as functional assays. See, e.g.. Ford et al.. Science^ 280:1271-1274, 1998. 
cAMP production is a measure of the ability of the receptor to stimulate the 
generation of cAMP upon binding by a receptor agonist. MCR is known to couple to G- 
proteins and thereby activate adenyl cyclase, increasing intracellular levels of cAMP (e.g., 
Buckley & Ramachandran, Proc, Natl Acad. ScL, 78: 7431-7435, 1981; Grahame-Smith et 

25 al., 1967, J. Biol Chem. 242: 5535-5541; Mertz & Catt, 1991, Proc. Natl Acad Sci. 88: 
8525-8529; Pawalek et al., 1976, Invest Dermatol 66: 200-209). This property of cells 
e5q)ressing the MCR-IC can be used assess its "cAMP production activity." For example, 
cells can be transfected with MCR-IC DNA, plated, and washed once with DMEM 
containing 1% bovine serum albumin (BSA) and 0.5 mM IBMX (a phosphodiesterase 

30 inhibitor). The cells can then be treated with hormone (e.g., alpha-MSH, gamma-MSH, 

ACTH, etc.). Following hormone treatment, the cells can be washed wifli phosphate buffered 
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saline, and intracellular cAMP extracted by lysing the cells. Intracellular cAMP 
concentrations can be determined routinely, e.g,, using an assay (Amersham) which measures 
flie ability of cAMP to displace cAMP ftom a high afiBnity cAMP binding protein (see 
Gihnan, 1970, Proc. Natl Acad, ScL, 67: 305-312). 

5 Polynucleotide and polypeptides of the presCTit invention can be used for a variety of 

purposes, including, but not limited to, treating cancers, treating skin cancer and other 
cancers modulating skin and hair pigmentation, identifying MCR Ugands, modulating the 
MCR-1 receptor types, determining susceptibility to skin cancer, detecting MCR-IC 
expression, determining polymorphisms in MCR-IC, making MCR-IC polypeptide, 

10 expressmg MCR-1 C in host cells, making antibodies to MCR-1 receptor types, modulating 
cutaneous inflammation (see, e.g., Bhardwaj et al., J. Immunol^ 158:3378-3384, 1997; Luger 
et al.,^/i. NY Acad. ScU 917:232-238, 2000), modulating melanocytes, monocytes, 
endothelial cells, or other cells in which MCR-IC is expressed, etc. 

The expression of MCR-IC on the surface of melanoma cells makes it a useful target. 

1 5 Melanoma is a skin cancer which originates from melanocytes present normally in the 

epidermis and underlying cell layers. There are four basic types: lentigo maligma melanoma, 
siq>erficial spreading melanoma, nodular melanoma, and acral lentigous melanoma. Because 
of its expression on melanocj^es, MCR-IC specific antibodies and other binding partners can 
be used to treat melanoma, e.g., by conjugating cytotoxic agents to antibodies directed to flie 

20 receptor. In addition, MCR-1 C polynucleotides, polypeptides, and binding partners thereto 
can be used to detect metastatic melanoma cells. 

Modulation of the MCR-IC can also be used to modulate skin pigmentation, e.g., to 
increase the amount brown and black pigments to darken sidn color, to provide protective 
effects against UV radiation, to block receptor activation, e.g., preventing or reducing the 

25 accumulation of brown and black pigments in the skin, preventing or reducing tanning, 
preventing or reducing skin freckling, etc. Agonists and antagonists of the melanocortin 
receptor, include, alpha-melanocyte stimulating hormone and adrenocorticotropic hormone. 
Other Ugands are disclosed in, e.g., W09957148, U.S. Pat. Nos. 5,731,408, 6,100,048, and 
6,350,760, and can be identified and isolated as described in these patents, as weU as 

30 WO0039147. 

As discussed earlier, several MCR-1 alleles have been associated with a greater risk 





PCTAUS03/09921 



WO 03/085095 



-16- 



of skin cancer. For example, the presence of the Asp84Glu variant imposed a high risk of 
melanoma in individual carriers. See, Kennedy et al., J, Im^est. Dermatol.^ 1 17:294-300, 
2001 . Otiier alleles with increased risk of melanoma included, Val60Leu, Val92Met, 
Argl42ffis, ArglSlCys, Argl60Trp, Argl63Ghi, andHis260Pro (Kemiedy et al.). See, also, 
5 Scott et al., J. Cell. ScL, 1 15 (PL 1 1):2349-2355, 2002. MCR-IC of the present invention can 
be used to assess melanoma risks, e.g., determining the presence of a variant of MCR-IC in 
individuals, and whethCT such variants are associated with skin cancer and other melanocyte 
disorders. Analysis can be performed by any suitable method, e.g., by single-stranded 
conformation polymorphism analysis and DNA sequence analysis. 

1 0 Expression can also be "selective," where expression is observed. By the phrase 

"selectively expressed," it is meant that a nucleic acid molecule comprising the dejSned 
sequence of nucleotides, when produced as a transcript, is characteristic of the tissue or cell- 
type in which it is made. This can mean that the transcript is expressed only in that tissue 
and in no other tissue-type, or it can mean that the transcript is expressed preferentially, 

15 differentially, and more abundantly (e.g., at least 5-fold, 10-fold, etc., or more) in that tissue 
when compared to other tissue-types. 

In view of their selectivity and display on the cell surface, MCR-IC polypeptides of 
the present invention are a useful target for histological, diagnostic, and therapeutic 
appUcations relating to the cells in which they are expressed. Antibodies and other protein 

20 binding partners (e.g., ligands, aptamers, small peptides, etc.) can be used to selectively target 
agents to a tissue for any purpose, included, but not limited to, imaging, therapeutic, 
diagnostic, drug delivery, gene therapy, etc. For example, binding partners, such as 
antibodies, can be used to treat melanomas in analogy to how c-erbB-2 antibodies are used to 
breast cancer. They can also be used to detect metastatic cells, in biopsies, etc. The g^es 

25 and polypeptides encoded thereby can also be used in tissue engineering to identify tissues as 
they appear during the differentiation process, to target tissues, to modulate tissue growth 
(e.g., from starting stem cell popxilations), etc. Useful antibodies or other binding partners 
include those that are specific for parts of the polypeptide which are exposed extracellularly. 
Any of the methods described above and below can be accomplished in vivo, in vitro, or ex 



30 



vivo. 



Binding partnars can also be used as to specifically deliver therapeutic agents to a 
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tissue of interest For example, a gene to be delivered to a tissue can be conjugated to a 
binding partner (directly or through a polymer, etc.), in liposomes comprising cell surface, 
and then administered as appropriate to the subject who is to be treated Additionally, 
cytotoxic, c^ostatic, and other therapeutic agents can be delivered specifically to the tissue to 

5 treat and/or prevent any of the conditions associated witit the tissue of interest. 

The present invention relates to methods of detecting melanoma cells, comprising one 
or more of the following steps, e.g., contacting a sample comprising cells with a 
polynucleotide specific for MCR-IC (e.g., amino acids 367-398, and firagments thereof), or a 
mammalian homolog thereof, under conditions eflBective for said polynucleotide to hybridize 

1 0 specifically to said gene, and detecting specific hybridization. Detecting can be 

accomplished by any suitable method and technology, including, e.g., any of those mentioned 
and discussed below, such as Northern blot and PGR. Specific polynucleotides include SEQ 
ID NOS 32-34, and complements thereto. 



15 melanocytes, e.g., for diagnostic, therapeutic, and prognostic purposes, including the 

treatment of melanoma. Methods of delivering an agent to a melanocyte cell can comprise. 



melanocortm receptor gene of the present invention, whereby said agent is delivered to said 
cell. Any^eof agent can be used, including, therapeutic and imaging agents. Contact with 

20 the melanocyte (e.g., a melanoma) can be achieved in any effective manner, including by 

administering effective amounts of the agent to a host orally, parentally, locally, systemically, 
intravenously, etc. The phrase "an agent coupled to binding partner*' indicates tiiat the agent 
is associated with the binding partner in such a manner that it can be carried specifically to 
the target site. Coupling includes, chemical bonding, covalent bonding, noncovalent bonding 

25 (where such bonding is sufficient to carry the agent to the target), present in a liposome or in 
a lipid membrane, associated with a carrier, such as a polymeric carrier, etc. The agent can 
be directiy linked to the binding partner, or via chemical linkers or spacers. 

Tm Aging of specific organs can be facilitated using tissue selective antibodies and 
other binding partners that selectively target contrast ag^ts to a specific site in the body. 

30 Various imaging techniques have been used in this context, including, e.g.. X-ray, CT, CAT, 
MRI, ultrasound, PET, SPECT, and sdntographic. A reporter agent can be conjugated or 



As indicated above, binding partners can be used to deliver agents specifically to 



e.g., contacting a melanocyte with an agent coupled to binding partner specific for a 
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/ associated routinely wifli a binding partner. Ultrasound contrast agents combined with 
binding partners, such as antibodies, are desoibed in, e.g., U.S. Pat Nos. 6,264,917, 
6,254,852, 6,245,318, and 6,139,819. MRI contrast agents, such as metal chelators, 
radionucleotides, paramagnetic ions, etc., combined with selective targeting agents are also 
5 described in the Uterature, e.g., in U.S. Pat Nos. 6,280,706 and 6,221,334. The methods 
described therein can be used generally to associate a partner with an agait for any desired 
purpose. 

A transgenic animal with a disrupted melanocortm-lC receptor can have a 
pigmentation phenotype, e.g., red or fair hair. Functional disruption of the gene pan be 

1 0 accomplished in any effective way, including, e.g., introduction of a stop codon into any part 
of the coding sequence, e.g., to prevent expression of amino acids 367-398, such ttiat the 
resulting polypeptide is biologically inactive or lacks one or more of its functional regions, 
introduction of a mutation into a promoter or other regulatory sequence that is effective to 
tum it off, or reduce transcription of the gene, insertion of an exogenous sequence into the 

15 gene which inactivates it (e.g., which disrupts the production of a biologically-active 

polypeptide or which disrupts the promoter or other transcriptional machinery), deletion of 
sequences from the gene, etc. A transgenic animal, or animal cell, lacking one or more 
functional genes of the present invention can be useful in a variety of applications, including, 
as an animal model for conditions and diseases associated with melanocortin-1 C, for drag 

20 screening (e.g., by making a cell deficient in MCR- 1 C, the contribution of flie activity 
remaining variants, such as MCR-IB and the 3 17-amino acid form, can be assessed), as a 
source of tissues deficient in one or more MCR-1 activities. The animal's endogenous locus 
can be replaced with a continuous coding sequence for MCR-IC, such that only MCR-IC is 
expressed, and no other form, such as MCR-IB and the 317-amino acid form, are expressed. 

25 PGR based methods can also be used in the methods of detecting polynucleotides for 

human MCR-IC. In such methods, more than one probe specific for MCR-IC can be used, 
e.g., a pair of specific polynucleotide probes which are capable of amplifying a 
polynucleotide sequence of MCR-IC, such as corresponding to amino adds 1-366, 367-398, 
etc., of SEQ ID NO 26. For instance, SEQ ID NO 32 is in exon 1, SEQ ID NO 33 spans 

30 exons 2-3, and SEQ ID NO 34 is in exon 4. Thus, in a PCR reaction, SEQ IDS 32 and 33 
produce a firagment about 262 base pairs that is absent in MCR-1 A and MCR-IB. SEQ ID 
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NOS 32 and 34 in a PGR reaction produce a fiagment of about 615 base pairs which is absent 
from MCR-1 A and MCR-IB. 

Mutant alleles, polymorphisms, SNPs, etc., can be identified and isolated from 
melanomas and other skm conditions that are known, or suspected to have, a genetic 
5 component Identification of such genes can be carried out routinely (see, above for more 
guidance), e.g., using PGR, hybridization techniques, direct sequencing, mismatch reactions 
(see, e.g., above), RFLP analysis, SSCP (e.g., Orita et al., Proc. Natl Acad, Scl, 86:2766, 
1992), etc., where a polynucleotide having a sequence selected firom SEQ ID NO 25 
(especially corresponding to amino acids 367-398) can be used as a probe. The selected 

1 0 mutant alleles, SNPs, polymorphisms, etc., can be used diagnostically to determine whether a 
subject has, or is susceptible to a melanoma or other condition (e.g., pigmentation variation, 
inflammatoiy condition) associated with a melanocortin receptor gene of the present 
invention, as well as to design therapies and predict the outcome of the disorder. 

The present invention an isolated polynucleotide comprising, a polynucleotide 

1 5 sequence coding without interruption for a human MCR-1 C, or complanent thereto, said 
MCR-IC having 84% or more amino acid sequence identity along its entire length to the 
sequence comprising anoino acids 1-316 of SEQ ID NO 26, and 90% or more amino acid 
sequence identity along its entire length to the sequence comprising from amino acids 317- 
398 of SEQ ID NO 26, and which has ligand-binding activity, G-protein binding activity, or 

20 cAMP production activity. 



OTB860 

OTB860 codes for an intracellular polypeptide comprising 1700 amino acids. Its 
nucleotide and amino acid sequences are shown in SEQ ID NOS 38 and 39. Expression of 
25 OTB860 is detected predominantly in heart and brain tissues, with minimally detectable 
levels in breast and testes tissues. The expression pattern is illustrated in Fig. 8. 

As shown in Fig. 9, OTB860 is related to KIAA1678 (also known as AB051465; 
SEQ ID NO 42). It contains 369 amino adds at its N-terminal end which are not present in 
KIAA1678, and a 29 amino acid insertion (1546-1574) at about amino add position 1545. It 
30 also differs at amino add positions 847 (histidine instead of gMainine) and 867 (arginine 
instead of glutamine). 
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Consistent with the expression pattern of OTB860, there are a number of functional 
and developmental pathways that are shared betwem neuronal and cardiac cells. For 
example, neuregulins (neuregulin-1 — NRGl) play an important role in myocardial and 
neuronal development Nfice deficimt in Igl^domain containing neuregulins have severe 

5 defects in the developing heart and nervous system. See, e.g., Kramer et al., Proc. Natl 

Acad. ScL, 93: 4833-4838, 1996; Zhao et al., J. Biol. Chem., 273:10261-10269, 1998. These 
effects appear to be mediated by Type I neuregulins. Meyer et al.. Develop, 124:3575-3586, 
1997. The neuroregulin receptor, erbB4, is also highly expressed in heart and brain, although 
its expression is not restricted to these tissues (data not shown). Adhesion pafliways are also 

1 0 shared between flie two tissues. For example, alpha4-integrin is expressed in both neuronal 
and cardiac cells. See, e.g., Pmco et al., Mech. Dev., 100:99-103, 2001. 

OTB860 maps to chromosomal band 2q36. A number of disorders have been mapped 
to, or in close proximity to, this chromosome location. These include, e.g., brachydactyly 
type Al, pili torti and nerve deafiiess, syndactyly type 1, Gracile syndrome (growth 

15 retardation and early death), and epilepsy. Recently, in a whole genome scan, 2q36 was 
identified as a lociis associated with acute coronary syndrome (Harrap et al., Arterioscler. 
Thromb. Vase. Biol., 22:874-878, 2002), involving myocardial infarction, unstable angina, 
atherosclerotic plaque disruption, and coronary thrombosis. Nucleic acids of the present 
invention can be used as linkage markers, diagnostic targets, therapeutic targets, for any of 

20 the mentioned disorders, as well as any disorders or genes mapping in proximity to it. 

OTB860 can be tised in diagnostic, therapeutic, prophylactic, and research 
appUcations. RNA and polypeptide detection methods can be used to determine whether a 
sample comjwises neuronal or cardiac tissues. When a positive is obtained, cell type markers 
can be used to determine precisely whether the tissue is neuronal or cardiac. For example, 

25 the presence or absence of a neuronal marker would distinguish between brain and heart 
tissues. Non-limiting examples of neuronal markers include, presenilins, genes and 
polypeptides in the pathways for neurotransmitter synthesis, recqptor, metabolism, etc., (e.g., 
serotonin, MAO, dopamine, norephinephrine, nitric oxide, etc.), apoUpoprotein A, APP, 
neuron-specific enolase (NSE), glial fibrillary acidic protem (GFAP), SlOO, GAP-43, neuron- 

30 specific beta-m tubulin, Stac (neuron-specific protein with an SH3 domain, e.g.. Genomics, 
47:140-2, 1998), myelin basic protem, vimentin, etc. Non-limiting examples of heart tissue 
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markeis include, caidiac troponin I, and smooth muscle maikers such as CRP 1 . 

The piesent invention also relates to polypeptide detection methods for assessing 
heart or brain function, e.g., methods of assessing heart or brain function, comprising, 
detecting OTB860 polypeptide, or fragments thereof in a body fluid, whereby the level of 

5 OTB860 polypeptide in said fluid is a measure of heart or brain function* Heart or brain 
function tests are xisually performed to determine whether the organ is functioning normally 
as a way of diagnosing disease. Various tests are used to test for heart function, such as 
electrocardiogram, stress test, echocardiogram, oxygen levels, and cardiac enzyme tests (e.g., 
creatine phospholdnase, troponin, lactate dehydrogenase, and myoglobia). Detection of 

10 OTB860 provides an additional assessment tool, specially m diseases such as myocardial 
infarction and other conditions, e.g., where cellular debris, etc., is released systemically. As 
with tiie other tests, elevated levels of OTB860 in blood, or other fluids, can indicate 
impaired brain or heart function. Values can be determined routinely, as they are for other 
functional markers. 

1 5 OTB860 polynucleotides and polypeptides can be used to treat, prevent and diagnose 

diseases of the heart, including, e.g., acute coronary syndrome, myocardial hypertrophy, heart 
failure, conduction disordres, arrhythmias, bradyarrhyfhmias, sinus node dysfunction, 
tachyarrhythmias, tachycardias, atrial fibrillation, congenital heart diseases (see, e.g., 
Harrison's Principles of Internal Medicine, Volume 1, 12*^ Edition, 1991, Pages 924-925), 

20 atrial and ventricular septal defects, congenital aortic stenosis, coartation of the aorta, 
valvular heart disease, myocardial infarction, ischemic heart disease, cardiomyopathy, 
pCTOcardial diseases, cardiac tumors (e.g., myxoma, Ipotna, papillary fibroelastoma, 
rhabdomyoma, sarcoma, etc), coronary artery disease, atherosclerosis, aortic aneurysm, etc. 
OTB860 polynucleotides and polypeptides can also be used to treat, prevent and 

25 diagnose diseases of the brain, including, e.g., vascular diseases, hypoxia, ischemia, 
infarction, tumors, neuroglial tumors, astrocytoma, glioblastoma multifome, pilocytic 
astrocytoma, oligodendroglioma, ependymona, choriod plexus papilloma, neuronal tumors, 
neuroblastoma, ganglioneuroma, gangliocytoma, gangliogioma, primitive or undifferentiated 
tumors, meduUoblastoma, tumors of meninges, mingioma, lymphomas, demyelinating 

30 diseases, multiple sclerosism perivenous encephalitis, degenerative diseases, Alzhenner's, 
Pick's, Huntmgton's. Parkinsonism, ALS, Werdnig-Hoffinan, degenerative diseases of 
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cerebral cortex, ganglia, brainstem, and motor neurons, inborn errors of metabolism, 
dCTQyelinating and dysmyelinating disorders, Pelizaeus-Merzbacher disease, multiple 
sclerosis, various leukodystrophies, post-traumatic demyelination, cerebrovascular (CVS) 
accidents, neuritis, neuropathies, particularly, multifocal leucoencephalopalhy, Guillain-Baire 

5 syndrome, retrobulbar neuritis, acute rubella encephalitis, chronic relapsing polyneuropafliy, 
intravascular lymphomatosis, Krabbe disease, globoid cell leukodystrophy, subacute 
combined degeneration of the spinal cord and brain, allergic encephalitis, murine 
caronavirus, hepatitis virus infection, or Theiler's murine encephalomyelitis, prion diseases, 
Creutzfeldt-Jakob, especially, febrile famiUal convulsions, epilepsy, vascular neuromyopafliy, 

1 0 cerebellar ataxia, etc. 

When expression is described as being **predominantly" in a given tissue, this 
indicates that the gene's mRNAs levels are highest in this tissue as compared to the other 
tissues in which it was measured. Expression can also be "selective," where expression is 
observed. By the phrase "selectively expressed," it is meant that a nucleic acid molecule 

1 5 comprising the delBned sequence of nucleotides, when produced as a transcript, is 

characteristic of the tissue or cell-type in which it is made. This can mean that the transcript 
is expressed only in that tissue and in no otiier tissue-type, or it can mean that the transcript is 
expressed preferentially, differentially, and more abundantiy (e.g., at least 5-fold, 10-fold, 
etc., or more) in that tissue when compared to other tissue-types. 

20 The present invention relates to methods of detecting brain or heart cells, comprising 

one or more of the following steps, e.g., contacting a sample comprismg cells wifli a 
polynucleotide specific for OTB860 (e.g., SEQ ID NOS 40-41), or a mammalian homolog 
thereof, under conditions effective for said polynucleotide to hybridize specifically to said 
gene, and detecting specific hybridization. Detecting can be accomplished by any suitable 

25 method and technology, including, e.g., any of those mentioned and discussed below, such as 
Northem blot and PGR. Specific polynucleotides include SEQ ID NOS 40-41, and 
complements thereto. 

Detection can also be achieved using binding partners, such as antibodies (e.g., 
monoclonal or polyclonal antibodies) that specifically recognize polypeptides coded for by 

30 genes of the present inventioiL Thus, the present invention relates to methods of detecting a 
brain or heart cell, comprising, one or more the following steps, e.g. contactuig a sample 
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comprising cells with a bindmg partner (e.g. an antibody, an Fab fiagmoit, a single-diain 
antibody, an aptamec) specific for a polypeptide coded for by OTB860 (e.g., SEQ ID NO 39), 
or a tnainnialiaii homolog theteol^ under conditions effective for said binding partner bind 
specifically to said polypeptide, and detecting specific binding. Protein binding assays can be 

5 accomplished routinely, e.g., using immunocytochemistiy, ELISA format. Western blots, etc. 
Usefiil epitopes include those exposed to the surfece. 

A brain or heart cell (see above for examples of brain or heart cell types) can also be 
modulated in accordance with the presoxt invention, e.g., by methods of modulating a brain 
or heart cell, comprising, e.g., contacting said cell with an agent effective to modulate 

10 OTB860, or ihe biological activity of a polypeptide encoded therdiy (e.g., SEQ ID NO 39), 
or a mammalian homolog thcaieof, whereby said brain or heart cell is modulated. Modulation 
as used throu^out includes, e.g., stimulating, increasing, agonizing, activating, ampli^g, 
blocking, inhibiting, reducing, antagonizing, preventing, decreasing, diminishing, etc. Any 
activity or fimction of the brain or heart cell can be modulated, including, e.g., development, 

15 differentiation, signaling, excitability, etc. 

The present invention also relates to methods of modulating development of cardiac 
or neuronal cells, comprising, e.g., administering an agent which is effective for modulating 
the expression of OTB860, or the biological activity of a polypeptide encoded thereby, 
whereby the development of said cardiac or neuronal cell is modulated. Development is 

20 meant to include any process in which a cell or tissue matures, including differentiation, 
organogenesis, cell proliferation, cell survival, expression and induction of fimctional 
molecules, cell movement and migration, apoptosis, modulation of gene expression, 
trabeculation, etc. Examples of heart and brain development that can be modulation are 
disclosed in Kramer et al., Proc. Natl. Acad. ScL, 93:4833-4838, 1996. Any agent can be 

25 used to modulate development in any environment, e.g., in situ, in vivo, or in vitro. 

OTB860 can be used to detect, modulate, etc., any of the cell types present in the 
heart or brain, including, but not linuted to, heart cells comprising the coverings of the heart 
(e.g., pericardium, fibrous pericardium, serous pericardium containing the parietal layar and 
epicardium), heart wall comprising mycardium, cardiac muscle, and endocardium 

30 (endothelial), blood vessels, valves, and autorhythmic cardiac cells such as those in the SA 
and VA nodes, brain and other neuronal cells, such neurons, glia, microglia, epen^raial cells. 
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oligodendrocytes, Schwann cells, and satellite cells. 

Promoter sequences obtained fixim OTB860 can be utilized to selectively express 
heterologous genes in brain or heart cells. Methods of e^[>ressing a heterologous 
polynucleotide in brain or heart cells can comprise, e.g., expressing a nucleic acid construct 
5 in brain or heart cells, said construct comprising a promoter sequence operably liiiked to said 
heterologous polynucleotide, wherein said promoter sequence is obtained from OTB860, e.g., 
on genomic NT_0221 15.8. In addition to the cell lines mentioned below, the construct can 
be expressed in primary cells or in established cell lines. 

The present invraition also relates to methods of modulating development of cardiac 
10 or neuronal cells, comprising, e.g., administering an agent which is effective for modulating 
the expression of OTB60, or the biological activity of a polypeptide encoded therd)y, 
whoreby the development of said cardiac or neuronal cell is modulated. 

The present invention also relates to a mammalian cell whose genome comprises a 
functional disruption of the human 013860 gene within a polynucleotide sequence coding 
15 for amino add residues 1-369 (SEQ ID NO 39) or 1546-1574 (SEQ ID NO 39). A non- 
human, transgenic mammal comprising such a cell can have a heart or neuronal tissue defect 

Antibodies can be produced, e.g., an antibody which is specific-for: an epitope 
comprising amino acid 847, amino add 867, or an epitope contained witii amino adds 1-369 
(SEQ ID NO 39), or amino adds 1546-1574 (SEQ ID NO 39). 

20 

TARRP 

Human TARPP (thymocyte cydic AMP regulated phosphoprotein, or, Brl37A, B, C, 
D, and E) is represented by a femily of altranative splice variants. Figs. 10-12 summarize the 
differences between the multiple forms. Brl37E is an 847 amino add polypeptide. Its 

25 nucleotide and amino add sequences are shown in SEQ ID NOS 43 and 44. Brl37B(SEQ 
ID NO 47 and 48) has a deletion of amino adds 267-300, Brl37A (SEQ ID NO 45 and 46) 
has adeletion of amino adds 312-331, and Brl37C (SEQ ID NO 49 and 50) has a deletion of 
both these domains. Brl37D (SEQ ID NO 51 and 52) contains only tiie first 87 amino adds 
followed by a two-amino add N-tetminus whidi differs fiom the other forms. A partial 

30 clone, AL133 109 (SEQ ID NO 55) as shown in Fig. 10, is missing the first 161 amino adds 
of Brl37B, as well as having an amino add diffeacence at position 3 12 (SEQ ID NO 44). 
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Brl37E contains a nuclear localization signal at about amino adds 107-124, an R3H 
domain (sin^e-stranded nucleic add binding domain) at about amino acids 147-224, and a 
inoline ridi region at about amino adds 476-682. These domains are also present in the A-C 
splice fomis, but at diflFerent amino add positions. Human TARPP has nucldc add binding 
activity conferred by the corresponding binding domain indicating fliat it can bind nucldc 
adds, preferably single-stranded DNA or RNA. This binding activity can be assayed 
routinely, e.g., using gel electrophoresis band shift assays, e.g., as carried out in, e.g., U.S. 
Pat No. 6,333,407 and 5,789,538, EUSA-based assays (e.g.. Mercury™ TransFactor Kit 
ficom Clontech), and other assays which detect DNA-protein interactions. 

The Brl37 femily represent the human homologs of murine TARPP (thymocyte 
ARPP) (NM_033264; SEQ ID NO 53; "Mouse" in Fig. 12). Brl37E has about 83% amino 
add identity and 87% homology with it (calculated using the BLAST algorithm). See, Fig. 
12 (NM_033264 is murine TARPP). In addition to amino add sequeace differences 
between the two proteins, human TARPP has an insertion at about amino add positions 549- 
572 of SEQ ID NO 44 which is not present in the mouse protein. See, Fig. 12. 

Originally, a 21 kDa polypeptide was isolated from rat basal ganglia based on its 
phosphorylation by cAMP-dependent protein kinase (PKA). Williams et al., /. Neuroscu, 
9:3631-3637,1989. It was named ARPP-21 (cAMP-regulatedphosphoprotein). Activation 
of dopamine receptors resulted in an inorease in the phosphorylation of ARPP-2 1 . Caporaso 
et al., Neuropharm., 39:1637-1644, 2000. Human ARPP-21 (Brl37D) contains 89 amino 
adds (NM_016300; SEQ ID NO 52). 

A high molecular wdght polypeptide of ARPP-21 was subsequently identified in T- 
cells and named TARPP. Kisielow et al., £:«r. J. /inmimo/., 31:1141-1149, 2001. This 
polype5)tide contains ARPP-21 sequence at its 5' end, but a novel 3' end coding for more 
than 700 additional amino acids (for a total of 807 amino adds). Murine TARPP appears to 
be involved in the regulation of thymocyte maturation and TCR rearrangement Expression 
of TARPP is down-regulated after the TCR signals delivered. It is highly expressed in 
immature diymocytes and is assodated with tiie commitment to the T-ceU lineage, making it 
highly selective marks: for T-cell commitment See, Kisielow. ibid. After commitment to 
the T-cell lineage during positive selection, its ejqiression is turned off. 

There ^pear to be several members of the human TARPP femily. KIAA0029 is a 
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hypothetical protein that shares about 45% amino add sequence identity and 59% homology 

with Brl37E. KIAA1002, a second hypothetical protein, has about 42% amino acid identity 

and 54% homology to it 

Human TARPP is highly expressed in brain, pituitary, muscle, and thymus. It is 
5 expressed at lower levels in adrenal gland, bone marrow, heart, small intestine, kidney, liver, 

ovary, prostate, stomach, testis, and thyroid. There was virtually no detectable expression in 

colon, lung, lymph node, peripheral lymphocytes, mammary gland, pancreas, and uterus. 

As indicated by its expression pattern, human TARPP is involved the maturation of 

T-cells, especially in the rearrangement of the TCR. For this reason, it can be used to 
10 modulate T-cells, e.g., in allrargy, autoimmune disease (e.g., rheumatoid arthritis and multiple 

sclerosis), and graft-host disease. It can also be used a marker to deteraiine the index of 

mature versus iiomature T-cells, where human TARPP is marker of immature T-cells. 

Additionally, human TARPP is phosphorylated upon dopamine receptor activation, 

indicating an involvement in dopamine pathways. Consequently, it is target for diseases that 
15 involve dopamine, including, e.g., schizophrenia, substance abuse and addiction, anxiety, 

Parkinson's disease, and other dopaminergic diseases and conditions. 

Human TARPP is localized to chromosomal band 3p21.33. There are several 

disorders genetically mapped to this region, including, e.g., retinal vasculopathy with cerebral 

leukodystrophy (OMIM 192315), deafaess, neurosensory, autosomal recessive 6 (OMIM 
20 600971), and lung cancer. Nucleic acids of the present invention can be used as linkage 

markers, diagnostic targets, dierapeutic targets, for any of the mentioned disorders, as well as 

any disorders or genes mapping in proximity to it 

Diseases or disorders which can be treated in accordance with the present invention 

include, but are not limited to autoimmune disease, such as multiple sclerosis and rheumatoid 
25 arthritis, and allergy 

The gene can be disrapted in a specific region, e.g., in the sequence coding for amino 

acids 1-161 of a human TARPP. Cells and/or animals can also have targeted deletions, e.g., 

deletion of a coding sequeace for amino adds 267-300 and^or 312-331 of a human TARPP. 

One or more the different splice forms, Brl37A-E can also be knocked-out or dismpted, e,g., 
30 to dissect out the individual activities. 
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The present invention relates to mettiods of modulating T-cells, comprising, 
contacting T-cells with an agent which is effective for regulating a human TARPP gene 
expressed in said cells, or for modulating the biological activity of a polypeptide encoded 
thereby. 

5 Included also in the present invention are oigineered cells, e.g., a human cell whose 

genome comprises a functional disruption of human TARPP in the region comprising the 
coding sequence for amino acids 1-161 of ahuman TARPP of SEQIDN0 44, orahuman 
cell whose genome comprises a deletion of a coding sequence for amino acids 267-300 
and/or 312-331 of ahuman TARPP of SEQ ID NO 44. 

10 Antibodies can be produced, e.g., an antibody whidhi is spedfic-for a human TARPP, 

said antibody which is specific for an epitope present in amino acid sequoices 1-161 , 88-161, 
267-300, 3 12-33 1 , or a polypeptide comprising amino acid 312, of a human TARPP of SEQ 
ID NO 44. 



15 LAT-1 

Liver-associated transmembrane protein-l ("LAT-1" or TMD008) codes for a 
polypeptide comprising 276 amino acids. Its expression is highly restricted to die liver, i.e., 
it is predominantly expressed in the liver. The nucleotide and amino acid sequences of it are 
shown in SEQ ID NOS 58 and 59. It contains transmembrane domains at about amino acid 

20 positions 24-46, 59-81, 101-123, 144-166, 203-225, and 237-259. It is homologous to the 
ol&ctory class of GPCR receptors. LAT-1 is also known as XM_060456 and AX242289. 

The gene for LAT-1 maps to chromosomal band lq22. Several diffCTent disorders 
map to this location, including, e.g., porphyria variegata, progression of lymphoma, 
Zellweger syndrome, Charcot-Marie-Tooth neuropathy-IB, congenital hypomyelination, 

25 nemaline myopathy, and CD3 zeta chain deficiency, medullary thyroid carcinoma, 

susceptibility to Vivax malaria, schizophrenia susceptability locus, autosomal dominant 
deafiiess, susceptibility to Lupus nephritis, familial hemiplegic migraine, apolipoprotein A-II 
deficiency, and familial hyperlipidemia. Nucleic acids of the present invention can be used 
as linkage maikers, diagnostic targets, therapeutic targets, for any of the mentioned disorder^ 

30 as well as any disorders or genes ms^ing in proximity to it 

LAT-1 can be used as a diagnostic and prognostic marker for liver fonction and 
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disease, including any of the liver diseases abeady mentioned. For instance, blood s&naa 
levels of LAT-1 (as well as othra: bodily fluids) can be used as an indicator of livo: disease, 
especially those diseases diaractedzed by necrotic and degraierative lesions, such as hepatitis, 
toxicity, and dnhosis. Any condition which results in degeneration of the liver can result in 
5 the appearance of higher than normal amoimts of blood serum LAT-1 . LAT-1 can be used 
alone, or in combination with other molecular markers for liver function, such as bilirubin, 
serum aminotransferases (e.g., AST and ALT), alkaline phosphatase, gamma- 
glutamyltranspeptidase (GGT), albumin, globulin, and blood aimnonia. 

Because of the selectivity of LAT-1 for the liver, it is a useful target for botii 
10 histological and therapeutic applications. Antibodies and other LAT-1 binding partners can 
be used to selectively targ^ agents to liver tissue for any purpose, included, but not limited 
to, imaging, therapeutic, diagnostic, drug delivery, gene ther^y, etc. For example, LAT-1 
binding partners, such as antibodies, can be used to treat liver cardnoma, in analogy to how 
c-erbB-2 antibodies are \ised to breast cancer, to detect metastatic liver cells, etc. Useful 
1 5 antibodies or other binding partners include those that are specific for parts of LAT-1 which 
are exposed extracellularly, e.g. , amino acids 1-23, 82-100, 167-202, etc. 

Imaging of specific organs can be facilitated using agents, such as LAT- 1 , that can be 
used to selectively target contrast agents to a specific site in the body. Various imaging 
techniques have been used in tiiis context, including, e.g., X-ray, CT, CAT, MRI, ultrasound, 
20 PET, SPECT, and sdntographic. A reporta: agent can be conjugated or associated routinely 
with a LAT-1 binding partner. Ultrasound contrast agaits combined with binding partners, 
such as antibodies, are described in, e.g., U.S. Pat. Nos, 6,264,917, 6,254,852, 6,245,318, and 
6,139,819. MRI contrast agents, such as metal chelators, radionucleotides, paramagnetic 
ions, etc., combined with selective targeting agents are also described in tiie literature, e.g., in 
25 U.S. Pat. Nos. 6,280,706 and 6,22 1 ,334. The metiiods described therein can be used 
generally to associate a LAT-1 binding partner with an agent for any desired purpose. 

LAT-1 binding partners can also be used as to specifically deliver tiierapeutic agents 
to the liver. For example, hyperdholestarolCTiia and other metabolic diseases can be treated 
by gene therapy, using the LAT-1 to specifically deliver tiie LDL receptor to the liver. The 
30 gene can be conjugated to a LAT-1 binding partner (direcfly or through a polymer, etc.), in 
liposomes comprismg cell sur&ce. Additionally, cfytotoxic, cytostatic, and other therapeutic 
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agents can be delivered to the liver via LAT-1 to treat and/or prevent any of the above- 
mentioned conditions associated with liver disease, e,g., carcinoma. 

The liver is the largest and most metabolically complex organ in the body. Its 
functions include, e.g., storage of iron, production of bile to fedlitate digestion, 

5 detoxifications of various exogenous chemicals, including alcohol and many drugs, energy 
stoclq)iling (carbohydrates and fat), production of clotting factors, and manufacture of blood. 
There are a number of diseases which affect the liver, including, Alagille syndrome, 
alcoholic liver disease, alpha- 1 -antitrypsin deficiency, autoimmune hepatitis, Budd-Chiari 
syndrome, biliary atresia, Byler disease, liver cancer, Caroli disease, cirrhosis, Crigler-Najjar 

10 syndrome, Dubin- Johnson syndrome, fetty liver, galactosemia, Gilbert syndrome, glycog^ 
storage disease, hemangioma, hemochromatosis, hepatitis A-G, porphyria cutanea tarda, 
primary biliary cirrhosis, protoporphyria, eryfhrohepatic. Rotor syndrome, sclerosiag 
cholangitis, and Wilson disease. Liver disease is of grave concern around flie world. 
The liver is divided into many small units, known as lobules. The lobule is the 

15 structural unit of the liver. Each lobule is comprised of radial plates of liver cells, called 
hepatocytes, and is surrounded by a connective sheath. A central vein ("CV") is located in 
the middle, and there are portal triads at the vertices. Each triad comprises a branch of the 
hepatic artery (supplying arterial blood to the lobule), a branch of the hepatic portal vein 
(carrying nutrient-rich blood firom the digestive viscera), and a bile duct. The blood fi:om the 

20 artery and portal vein flow into leaky capillaries, the Uvct sinusoids, which are located 
between the hepatic plates of the lobule. 

The acinus is the functional unit of the liver. While the boundaries of the lobule are 
well visible, those of the acinus are unrecognizable under the microscope. Arising like a 
berry, a grape (latin "acinus") on the vine around the portal triad, fhc liver acinus is formed of 

25 a mass of liver cells and sinusoids which drain toward two adjacent central veins. The 
principal metabolic functions of the liver are performed by hepatocytes. These functions 
include, e.g., formation and exaretion of bile, regulation of carbohydrate homeostasis, lipid 
synthesis and secretion of plasma lipoprotefais, regulation of cholestorol metabolism, 
formation of urea, serum albumin, clottmg &ctors, enzymes, and numerous other proteins; 

30 and metabolism or detoxification of drags and other foreign substances. Hepatocytes in 

different regions of the acinus perform different functions, e.g., ^uooneogenesis is primarily 
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a function of the zone of cells closest to the triad, whereas glycolysis mainly occurs in the 

fiirthest zone fix>m it. 

A promoter obtained from the LAT-1 can be used, e.g., in gene therapy to obtain 
tissue-specific expression of a heterologous gme (e.g., coding for a therapeutic product or 

5 cytotoxin). A promoter sequence is found at about nucleotide positions 1 164-1212 of SEQ 
ID NO 58 and can be used (e.g., 1 164 to the first ATG codon) to drive Uver-spedfic 
expression of a heterologous sequence. 5' and 3' sequences (including, UTRs and introns) 
can be used to modulate or regulate stability, transcription, and translation of nucleic acids, 
including the sequence to which is attached in nature, as well as heterologous nucleic acids. 

10 A polyadenyiation site is found at about nucleotide positions 4265-4275 of SEQ ID NO 58. 
The upstream 3*UTR can be used as described above. Usefid polypeptides include 
polypeptides exposed extracellularly, e.g. , amino acids 1-23, 82-100, 167-202, of SEQ ID 
NO 59, etc. 

The present invention also relates to methods of detecting human liver tissue in a 

15 sample, e.g., comprising tissue, cells, or other cellular materials or debris, comprising one or 
more of the following steps, e.g., contactmg said sample with a binding partner specific for 
human LAT-1 imder conditions effective for said binding partner to bind specifically to 
human LAT-1, and detecting specific binding between said binding partner and said human 
LAT-1, whereby specific binding indicates that liver tissue is present in said sample. 

20 The sample can be contacted with the binding partner in any n^nner which is 

effective to give the binding partner access to the material present in the tissue sample. How 
contact is achieved can depend on the format of the detection assay. For instance, if a ELISA 
assay is used, and the binding partner is an antibody on a solid phase in a well, flien placing 
an aqueous sample in the well would achieve contact between partner and sample. Any type 

25 of sample can be used, including, e.g., blood (whole blood, firactionated blood, serum, etc.), 
stool, urine, cerebral spinal fluid, tissue biopsy, etc. 

The binding partner, such as a monoclonal or polyclonal antibody, is specific for 
LAT-1, and is contacted with the sample under conditions effective for said binding partner 
to bind specifically to human LAT-1, if human LAT-1 is present in the sample. Specific 

30 binding, as previously discussed for polynucleotides, indicates that the binding partner binds 
or attaches to its target polypeptide without significant binding to other polypeptides C*non- 
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specific binding") mfli© sample. This concept is well known in the art The detection of 
specific binding can be accomplished by any of flie aforementioned assays. 

The present invention also relates to polypeptide detection methods for assessing liva: 
function, e.g., methods of assessing liver function, comprising, detecting LAT-1 polypeptide, 

5 or fragments thereof, in a body fluid, whereby tiie level of LAT-1 polypeptide in said fluid is 
a measure of liver function. liver function tests are usually performed to determine whetliCT 
the liver is functioning normally as a way of diagnosing liver disease. Various tests are 
commonly used, including, e.g,, alkaline phosphatase, alanine transferase, aspartate 
transferase, bilirubin, gamma-glutamyl transpeptidase, lactic dehydrogenase, 5 '-nucleotidase, 

10 albumin, alpha-fetoprotein, mitochondrial antibodies, and prothrombin time. See, e.g., 

Harrison's Principles of Internal Medicine^Yol^ 1308-1317, 12^ Edition, 1991. 

Detection of LAT-1 provides an additional assessment tool, especially in diseases such as 
hepatitis, carcinoma, liver toxicity, cirrhosis, and other liver conditions, e.g., where cellular 
debris, etc., is released systemically. As with the other tests, elevated levels of LAT-1 in 

1 5 blood, or other fluids, can indicate impaired liver function. Values can be determined 
routinely, as they are for other liver function markers. 



Nucleic acids 

A maimnalian polynucleotide, or fragment fhereoj^ of the present invention is a 
20 polynucleotide having a nucleotide sequence obtainable from a natural source. When the 
species name is used, e.g., a human, it indicates that the polynucleotide or polypeptide is 
obtainable from a natural source. It therefore includes naturally-occurring normal, naturally- 
occurring mutant, and naturally-occurring polymorphic alleles (e.g., SNPs), differaitially- 
sphced transcripts, splice- variants, etc. By the term '*naturally-occurring," it is meant that the 
25 polynucleotide is obtainable from a natural source, e.g., animal tissue and cells, body fluids, 
tissue culture cells, forensic samples. Natural sources include, e.g., living cells obtained from 
tissues and whole organisms, tumors, cultured cell lines, including primary and immortalized 
cell lines. Naturally-occurring mutations can include deletions (e.g., a truncated amino- or 
carboxy-terminus), substitutions, inversions, or additions of nucleotide sequence. These 
30 genes can be detected and isolated by polynucleotide hybridization according to methods 
which one skilled in the art would know, e.g., as discussed below. 
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A polynucleotide accoiding to flie presaat inventioii can be obtained from a variety of 
different sources. It can be obtained ftom DNA or RNA, such as polyadenylated mRNA or 
total RNA, e.g., isolated fiom tissues, cells, or whole organism. The polynucleotide can be 
obtained directly fiom DNA or RNA, ftom a cDNA library, fiom a genomic library, etc. The 

5 polynucleotide can be obtained fix)m a cell or tissue (e.g., fiom an embryonic or adult tissues) 
at a particular stage of development, having a desired genotype, phraiotype, disease status, 
etc. A polynucleotide which "codes without interruption" refers to a polynucleotide having a 
continuous open reading fi^e ("ORF") as compared to an ORF which is interrupted by 
introns or oflier noncoding sequences. 

10 Polynucleotides and polypeptides (indudmg any part of a differaitially regulated 

cancer gene) can be excluded as compositions fiom the presoat invention i^ e.g., listed in a 
publicly available databases on the day this application was filed and/or disclosed in a patent 
application having an earlier filing or priority date than this plication and/or conceived 
and/or reduced to practice earlier than a polynucleotide in this application. 

15 As described herein, the phrase "an isolated polynucleotide which is SEQ ID NO," or 

"an isolated polynucleotide which is selected from SEQ ID NO," refers to an isolated nucleic 
add molecule from which the recited sequence was derived (e.g., a cDNA derived from 
mRNA; cDNA derived from genomic DNA). Because of sequencing errors, typographical 
errors, etc., the actual naturally-occurring sequence may differ from a SEQ ID listed herein. 

20 Thus, the phrase mdicates the spedfic molecule fiom which the sequence was derived, ratho: 
than a molecule having that exact redted nucleotide sequence, analogously to how a culture 
depository nurnber refers to a spedfic cloned fiagment in a acyotube. 

As explained in more detail below, a polynucleotide sequence of the invention can 
contain the complete sequence as shown in the corresponding SEQ ID, degenerate sequences 

25 thereof, anti-sense, muteins thereof, genes comprising said sequences, full-length cDNAs 
comprising said sequences, complete genomic sequences, fragments thereof, homologs, 
primers, nucleic add molecules whidi hybridize thereto, derivatives thereof, etc. 



30 



Genomic 

The present invention also relates genomic DNA from whidi the polynucleotides of 
the present invention can be derived. A genomic DNA coding for a human, mouse, or other 
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tnarmnaTian polynucleotide, Can be obtained routinely, for example, by screening a graiondc 
library (e.g., a YAC library) with a polynucleotide of the present invention, or by searching 
nucleotide databases, such as GenBank and EMBI^ for matches. Promoter and other 
regulatory regions (including both 5* and 3' regions, as well introns) can be identified 

5 upstream or downstream of coding and expressed KNAs, and assayed routinely for activity, 
e.g., by joining to a reporter gene (e.g., CAT, GFP, alkaline phosphatase, lucifeiase, 
galatosidase). A promoter obtained from a gene can be xised, e.g., in g«ie therapy to obtain 
tissue-specific expression of a hetraologous gene (e.g., coding for a therapeutic product or 
cytotoxin). 5' and 3' sequences (including, UTRs and introns) can be used to modulate or 

1 0 regulate stability, transcription, and translation of nucleic adds, including the sequence to 
whidi is attadied in nature, as well as heterologous nucleic acids. 

Constructs 

A polynucleotide of the present invention can comprise additional polynucleotide 

1 5 sequences, e.g., sequences to enhance expression, detection, uptake, cataloging, tagging, etc. 
A polynucleotide can include only coding sequence; a coding sequence and additional non- 
naturally occurring or heterologous coding sequence (e.g., sequences coding for leader, 
signal, secr^ry, targeting, oizymatic, fluorescent, antibiotic resistance, and other functional 
or diagnostic peptides); coding sequences and non-coding sequences, e.g., untranslated 

20 sequences at either a 5' or 3 ' end, or dispersed in the coding sequence, e.g., introns. 

A polynucleotide according to the present invention also can comprise an expression 
control sequence opeiably linked to a polynucleotide as described above. The phrase 
"expression control sequence* means a polynucleotide sequence that regulates e}q[»:ession of 
a polypeptide coded for by a polynucleotide to which it is fanctionally ("operably") linked. 

25 Expression can be regulated at the level of the mRNA or polypeptide. Thus, tiie expression 
control sequence includes mRNA-related elements and protein-related elements. Such 
elements include promoters, enhancers (viral or cellular), ribosome binding sequences, 
transcriptional terminators, etc. An exjwession control sequence is operably linked to a 
nucleotide coding sequence when the expression control sequence is positioned in such a 

30 manner to effect or achieve etpressionofthe coding sequence. For example, when a 

promoter is operably linked 5' to a coding sequence, expression of flie coding sequence is 
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driven by the promoter. Expression control sequences can include an initiation codon and 
additional nucleotides to place a partial nucleotide sequence of the present invention in-fiame 
in order to produce a polypeptide (e.g., pET vectors fiom Promega have been designed to 
permit a molecule to be inserted into all three reading frames to identify the one that results 
5 in polypeptide expression). Expression control sequences can be heterologous or endogenous 
to the normal gene. 

A polynucleotide of the present invention can also comprise nucleic acid vector 
sequences, e.g., for cloning, expression, amplification, selection, etc. Any effective vector 
can be used. A vector is, e.g., a polynucleotide molecule which can repUcate autonomously 

10 in a host cell, e.g., containing an origm of replication. Vectors can be useful to perform 

maniptdations, to propagate, and/or obtain large quantities of the recombmant molecule in a 
desned host. A skilled worker can select a vector depending on the purpose desired, e.g., to 
propagate tiie recombinant molecule in bacteria, yeast, insect, or mammalian cells. The 
following vectors are provided by way of example. Bactraial: pQE70, pQE60, pQE-9 

15 (Qiagen), pBS, pDlO, Phagescript, phiX174, pBK Phagemid, pNH8A, pNH16a, pNHlSZ, 
pNH46A (Stratagene); Bluescript KS+H (Stratagene); pb:c99a, pKK223-3, pKK233-3, 
pDR54 0, pRITS (Pharmacia). Eukaryotic: PWLNEO, pSV2CAT, pOG44, pXTl, pSG 
(Stratagene), pSVKS, PBPV, PMSG, pSVL (Pharmacia), pCR2.1/TOPO, pCRD/TOPO, 
pCR4/TOPO, pTrcHisB, pCMV6-XL4, etc. However, any other vector, e.g., plasmids, 

20 viruses, or parts thereof, may be used as long as they are replicable and viable in the desired 
host. The vector can also comprise sequences which enable it to replicate in the host whose 
genome is to be modified. 

Hybridization 

25 Polynucleotide hybridization, as discussed in more detail below, is useful in a variety 

of appUcations, mcluding, in gene detection methods, for identifying mutations, for making 
mutations, to identify homologs in the same and diffCTent species, to identify related 
members of the same gene femily, in diagnostic and prognostic assays, in fherq)eutic 
applications (e.g., where an antisense polynucleotide is used to inhibit e>qpression), etc. 

30 The ability of two single-stranded polynucleotide preparations to hybridize together is 

a measure of their nucleotide sequence cmnplementarity, e.g., base-pairing between 
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nucleotides, sudi as A-T, G-C, etc The invention thus also relates to ^lynucleotides, and 
their complements, which hybridize to a polynucleotide comprising a nucleotide sequence as 
set forth in the sequences disclosed herein, and ^omic sequences thereof A nucleotide 
sequence hybridizing to the latter sequence will have a complementary polynucleotide strand, 
5 or act as a template for one in the presence of a polymerase (i.e., an appropriate 

polynucleotide synthesizing enzyme). The present invention includes both strands of 
polynucleotide, e.g., a sense strand and an anti-sense strand. 

Hybridization conditions can be diosen to select polynucleotides which have a 
desired amount of nucleotide complementarity with the nucleotide sequences set forth in 
10 SEQ ID NOS 1, 1 1, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58and genomic sequences thereof 
A polynucleotide capable of hybridizing to such sequence, preferably, possesses, e.g., about 
70%, 75%, 80%, 85%, 87%, 90%, 92%, 95%, 97%, 99%, or 100% complementarity, 
between the sequences. The preset invention particularly relates to polynucleotide 
sequences which hybridize to the nucleotide sequences set forth in the sequence disclosure 
1 5 herein or genomic sequences thereof, under low or high stringency conditions. These 
conditions can be used, e.g., to select corresponding homologs in non-human species. 

Polynucleotides which hybridize to polynucleotides of the presrait invention can be 
selected in various ways. Filter-type blots (i.e., matrices containing polynucleotide, such as 
nitrocellulose), glass chips, and other matrices and substrates comprising polynucleotides 
20 (short or long) of interest, can be incubated in a prehybridization solution (e.g., 6X SSC, 
0.5% SDS, 100 ng/ml denatured sahnon sperm DNA, 5X Denhardt's solution, and 50% 
foimamide), at 22-68°C, overnight, and then hybridized with a detectable polynucleotide 
probe under conditions appropriate to achieve the desked stringency. In general, when hi^ 
homology or sequence identity is desired, a high temperature can be used (e.g., 65 °C). As 
25 the homology drops, lower washing temperatures are used. For salt concentrations, the lower 
the salt concentration, tiie higher the stiingency. The lengtii of tiie probe is another 
consideration. Very short probes (e.g., less than 100 base pairs) are washed at lower 
temperatures, even if the homology is high- With short probes, formamide can be omitted. 
See, e.g.. Current Protocols in Molecular Biology, Ch^ter 6, Screemng of Recombinant 
30 Libraries; Sambrook et al.. Molecular Clonings 1989, Chapter 9. 
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For instance, high stringency conditions can be achieved by incubating the blot 
ovonight (e.g., at least 12 hours) with a polynl^lcleotide probe in a hybridization solution 
containing, e.g., about 5X SSC, 0.5% SDS, 100 \ig/ml denatured sahnon spenn DNA and 
50% foimamide, at 42°C, or hybridizing at 42°C in 5X SSPE, 0.5% SDS, and 50% 
5 formamide, 100 fig/ml denatured salmon sperm DNA, and washing at 65°C in 0. IX SSC and 
0.1% SDS. 

Blots can be washed at high stringency conditions that allow, e.g., for less than 5% 
bp mismatch (e.g., wash twice in 0.1% SSC and 0.1% SDS for 30 min at 65°C), i.e., 
selecting sequences having 95% or greater sequence identity. 

10 Other non-limiting examples of high stringaicy conditions includes a final wash at 

65''C in aqueous buffer containing 30 mM NaCl and 0.5% SDS. Another example of higji 
stringent conditions is hybridization in 7% SDS, 0.5 M NaP04, pH 7, 1 mM EDTA at 50''C. 
e.g., overnight, followed by one or more washes with a 1% SDS solution at 42**C. 

Whereas high stringency washes can allow for, e.g., less than 10%, less than 5% 

1 5 mismatch, etc., reduced or low stringency conditions can permit up to 20% nucleotide 

mismatch. Hybridization at low stringency can be accomplished as above, but using lower 
foimamide conditions, Iowa: temperatures and/or lower salt concentrations, as well as longer 
penods of incubation time. 

Hybridization can also be based on a calculation of melting temperature (Tm) of the 

20 hybrid formed between the probe and its target, as described in Sambrook et al.. Generally, 
the temperature Tm at which a short oligonucleotide (containing 18 nucleotides or fewer) 
will melt from its target sequaice is given by the following equation: Tm = (number of A's 
and T's)x2*'C + (number of C's and G's)x4*'C. For longer molecules, Tm = 81.5+ 16.6 
logio[Na*] + 0.41(%GC) - 600/N where [Na^ is the molar concentration of sodium ions, 

25 %GC is the percentage of GC base pairs in the probe, and N is the length. Hybridization can 
be carried out at several degrees below this temperature to ensure that the probe and target 
can hybridize. Mismatches can be allowed for by lowering the temperature even further. 

Stringent conditions can be selected to isolate sequences, and their complements, 
which have, e.g., at least about 90%, 95%, or 97%, nucleotide complementarity between the 

30 and a target polynucleotide. 
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Other homologs of polynucleotides of the present invention can be obtained firom 
mammalian and non-mammalian sources according to various methods. For example, 
hybridization with a polynucleotide can be employed to select homologs, e.g., as described in 
Sambrook et al.. Molecular Cloning, Chapter 1 1, 1989. Such homologs can have varying 

5 amounts of nucleotide and amino acid sequence identity and similarity to such 

polynucleotides of the present invention. Mammalian organisms include, e.g., mice, rats, 
monkeys, pigs, cows, etc. Non-mammalian organisms include, e.g., vertebrates, 
invertebrates, zebra fish, chicken, Drosophila, C. elegans, Xenopus, yeast such as S. pombe, 
S. cerevisiae, roundworms, prokaryotes, plants, Arabidopsis, artemia, viruses, etc. The 

10 degree of nucleotide sequmce identity between human and mouse can be about, e.g. 70% or 
more, 85% or more for open reading framed, etc. 

Aligmnent 

Alignments can be accomplished by using any effective algorithm. For pairwise 

1 5 alignments of DNA sequences, the methods described by Wilbur-Lipman (e.g., Wilbur and 
Lipman, Proc. Natl Acad. ScL, 80:726-730, 1983) or Martinez/Needleman-Wunsch (e.g., 
Marthxoz, Nucleic Acid Res., 11:4629-4634, 1983) can be used. For instance, if the 
Martin«!yNeedleman-Wunsch DNA alignment is applied, the minimum match can be set at 
9, gap penalty at 1.10, and gap length penalty at 0.33. The results can be calculated as a 

20 similarity index, equal to the sum of the matching residues divided by the sum of all residues 
and gap characters, and then multiplied by 100 to express as a percent. Similarity index for 
related gen&s at the nucleotide level in accordance with the present invention can be greater 
than 70%, 80%, 85%, 90%, 95%, 99%, or more. Pairs of protein sequences can be aligned 
by the Lipman-Pearson method (e.g., Lipman and Pearson, Science, 227:1435-1441, 1985) 

25 with k-tuple set at 2, gap penalty set at 4, and gap length penalty set at 12. Results can be 
expressed as percent similarity index, where related genes at the amino acid level in 
accordance with the present invention can be greater than 65%, 70%, 75%, 80%, 85%, 90%, 
95%, 99%, or more. Various commercial and firee sources of alignment programs are 
available, e.g., MegAlign by DNA Star, BLAST (National Center for Biotechnology 

30 Mbrmation), BCM (Baylor College of Medicine) Launcher, etc. BLAST can be used to 
calculate amino acid sequence identity, amino acid sequence homology, and nucleotide 




wo 03/085095 PCTAJS03/09921 

-38- 

sequence identity. These calculations can be made along the entire length of each of the 
target sequences 'which are to be compared. 

After two sequences have beoi aligned, a *'pCTC«it sequence identity** can be 
determined. For tiiese purposes, it is convenient to rrfer to a Reference Sequence and a 

5 Compared Sequence, where the Compared Sequaice is conq>ared to the Reference Sequmce. 
Percent sequence identity can be determined according to tiie following formula: PCTcrait 
Identity= 100 [1-<C/R)], wherein C is the number of diflferaices between title Reference 
Sequence and the Con^ared Sequence over the length of alignmrait between the Reference 
Sequence and the Compared Sequence where (i) eadi base or amino acid in the Reference 

10 Sequence that does not have a corresiponding aligned base or amino add in the Compared 

Sequence, (ii) each in the Reference Sequence, (iii) each aligned base or amino add in flie 
Reference Sequence that is different fixmi an aligned base or ammo add in the Compared 
Sequence, constitutes a difference; and R is the number of bases or amino adds in the 
Reference Sequence over the length of the ahgnment with the Compared Sequence with any 

1 5 gap created in the Reference Sequence also being counted as a base or amino add. When it is 
stated that a polynucleotide sequence has a certain percentage ving 95% or more sequence 
identity along die entire length of the polynucleotide sequence set forth in SEQ ID NO 1,11, 
16, 25, 38, 43, 45, 47, 49, 51, or 58, it means that when the polynucleotide is shorter than the 
mentioned SEQ ID NOS, the missing bases are counted for the purposes of the calculation. 

20 Percent sequence identity can also be determined by other conventional methods, e.g., as 
described in Altsdml et al.. Bull. Math. Bio. 48: 603-616, 1986 and Haiikoff and Henikof^ 
Proc. Natl. Acad. Sci. USA 89:10915-10919, 1992. 

Spedfic polynucleotide probes 

25 A polynucleotide of the present invention can comprise any continuous nucleotide 

sequence of SEQ ID NOS 1,11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, sequences which 
share sequence identity thereto, or complements thereof. The temi "probe" refers to any 
substance that can be used to detect identify, isolate, etc., another substance. A 
polynucleotide probe is comprised of nucldc add can be used to detect, identify, etc., other 

30 nucldc adds, such as DNA and RNA. 
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These polynucleotides can be of any desired size that is effective to achieve the 
specificity desired. For example, a probe can be from about 7 or 8 nucleotides to several 
thousand nucleotides, depending upon its use and purpose. For instance, a probe used as a 
primer PGR can be shorter than a probe used in an ordered array of polynucleotide probes. 

5 Probe sizes vary, and the invention is not limited in any way by their size, e.g., probes can be 
from about 7-2000 nucleotides, 7-1000, 8-700, 8^00, 8-500, 8-400, 8-300, 8-150, 8-100, 8- 
75, 7-50, 10-25, 14-16, at least about 8, at least about 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 21, 22, 23, 24, 25, 26, or more, etc. The polynucleotides can have non-naturally-occurring 
nucleotides, e.g., inosine, AZT, 3TC, etc. The polynucleotides can have 100% sequence 

10 identity or complementarity to a sequence of SEQ ID NOS 1, 11, 16,25,38,43,45,47,49, 
51, and/or 58, or it can have mismatches or nucleotide substitutions, e.g., 1, 2, 3, 4, or 5 
substitutions. The probes can be single-stranded or double-stranded. 

In accordance with the present invention, a polynucleotide can be present in a kit, 
where the kit includes, e.g., one or more polynucleotides, a desired buffer (e.g., phosphate, 

15 tris, etc.), detection compositions, RNA or cDNA from different tissues to be xised as 

controls, libraries, etc. The polynucleotide can be labeled or unlabeled, with radioactive or 
non-radioactive labels as known in the art. Kits can comprise one or more pairs of 
polynucleotides for amplifying nucleic acids specific for polynucleotides, e.g., comprising a 
forward and reverse primer effective in PGR. These include both sense and anti-sense 

20 orientations. For instance, in PCR-based methods (such as RT-PCR), a pair of primers are 
typically used, one having a sense sequence and the other haAong an antisense sequence. 

Another aspect of the present invention is a nucleotide sequence that is specific to, or 
for, a selective polynucleotide. The phrases "specific for" or "specific to" a polynucleotide 
have a functional meaning that the polynucleotide can be used to identify the presence of one 

25 or more target genes in a sample and distinguish them from non-target genes. It is specific in 
the sense that it can be used to detect polynucleotides above background noise C'non-specific 
binding**)- A specific sequence is a defined order of nucleotides (or amino acid sequences, if 
it is a polypeptide sequence) which occurs in the polynucleotide, e.g., in the nucleotide 
sequences of SEQ ID NOS 1, 11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, and which is 

30 characteristic of that target sequence, and substantially no non-taiget sequratces. A probe or 
n[iixture of probes can comprise a sequence or sequences that are specific to a plurality of 
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target sequences, e.g., where the sequence is a consmsus sequence, a functional domain, etc., 
e.g-, enable of recognizing a femily of related genes. Such sequences can be used as probes 
in any of the methods described herein or incorporated by referrace. Bofli sense and 
antisense nucleotide sequences are included. A specific polynucleotide according to the 

5 present invention can be determined routinely. 

A polynucleotide comprising a specific sequence can be used as a hybridization probe 
to identify the presence of, e.g., human or mouse polynucleotide, in a sample comprising a 
mixture of polynucleotides, e.g., on a Northern blot. Hybridization can be performed under 
high stringent conditions (see, above) to select polynucleotides (and their complements which 

10 can contain the coding sequence) having at least 90%, 95%, 99%, etc., identity (i.e., 

complementarity) to the probe, but less stringent conditions can also be used. A specific 
polynucleotide sequence can also be fused in-firame, at either its 5' or 3' end, to various 
nucleotide sequences as mentioned throughout the patent, including coding sequences for 
enzymes, detectable markers, GFP, etc, expression control sequences, etc. 

15 A polynucleotide probe, especially one that is specific to a polynucleotide of the 

present invention, can be used in gene detection and hybridization methods as already 
described. In one embodiment, a specific polynucleotide probe can be used to detect 
whether a particular tissue or cell-type is present in a target sample, e.g., with OTB 1 82, 
OTB860, or LAT-1 . To carry out such a method, a selective polynucleotide can be chosen 

20 which is characteristic of the desired target tissue. Such polynucleotide is preferably chosen 
so that it is expressed or displayed in the target tissue, but not in other tissues which are 
present in the sample. For instance, if detection of brain tissue is desired, it may not matter 
whether the selective polynucleotide is expressed in other tissues, as long as it is not 
expressed in cells normally present in blood, e.g., peripheral blood mononuclear cells. 

25 Starting fi-om the selective polynucleotide, a specific polynucleotide probe can be designed 
which hybridizes (if hybridization is the basis of flie assay) imder the hybridization conditions 
to the selective polynucleotide, whereby the presence of the selective polynucleotide can be 
determined. 

Probes which are specific for polynucleotides of the present invention can also be 
30 prepared using involve transcription-based systems, e.g., incorporating an RNA polymerase 
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piomoter into a selective polynucleotide of the preset invention, and Hbesa transcribing anti- 
sense RNA using the polynucleotide as a template. See, e.g., U.S. Pat. No. 5,545,522. 

Polynucleotide composition 

5 A polynucleotide according to the present invention can comprise, e.g., DNA, RNA, 

synthetic polynucleotide, peptide polynucleotide, modified nucleotides, dsDNA, ssDNA, 
ssRNA, dsRNA, and mixhires tiiereof. A polynucleotide can be single- or double-stimded, 
triplex, DNA:RNA, duplexes, comprise hairpins, and other secondary shructures, etc. 
Nucleotides comprising a polynucleotide can be joined via various known linkages, e.g., 

10 ester, sulfitmate, sul&mide, phosphorothioate, phosphoramidate, methylphosphonate, 
carbamate, etc., depending on the desned purpose, e.g., resistance to nucleases, such as 
RNAse H, improved in vivo stabiUty, etc. See, e.g., U.S. Pat. No. 5,378,825. Any desired 
nucleotide or nucleotide analog can be incorporated, e.g., 6-mercaptoguanine, 8-oxo-guanine, 
etc. 

15 Various inodifications can be made to the polynucleotides, such as attaching 

detectable markers (avidin, biotin, radioactive elements, fluorescent tags and dyes, energy 
transfer labels, energy-emitting labels, binding partners, etc.) or moieties which improve 
hybridization, detection, and/or stabiUty. The polynucleotides can also be attached to soUd 
supports, e.g., nitrocellulose, magnetic or paramagnetic microspheres (e.g., as described in 

20 U.S. Pat. No. 5,41 1,863; U.S. Pat. No. 5,543,289; for instance, comprising ^rromagnetic, 
supermagnetic, paramagnetic, superparamagnetic, iron oxide and polysaccharide), njdon, 
agarose, diazotized cellulose, latex solid microspheres, polyacrylamides, etc., according to a 
desired method. See, e.g., U.S. Pat Nos. 5,470,967, 5,476,925, and 5,478,893. 

Polynucleotide according to the present invention can be labeled according to any 

25 desired method. The polynucleotide can be labeled using radioactive tracers such as ^^P, ^% 
^H, or ^''C, to mention some commonly used tracers. The radioactive labeling can be carried 
out according to any method, such as, for example, terminal labeling at the 3 ' or 5' end using 
a radiolabeled nucleotide, polyiracleotide kinase (with or without dephosphorylation with a 
phosphatase) or a ligase (dq)ending on the end to be labeled). A non-radioactive labeling can 

30 also be used, combining a polynucleotide of the present invention with residues having 
immunological properties (antigetis, h^teos), a specific afBnity for certain reagoits 
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(ligands), properties enabling detectable enzyme reactions to be completed (oizymes or 
coenzymes, enzyme substrates, or other substances involved in an enzymatic reaction), or 
characteristic physical jooperties, such as fluorescence or flie emission or absorption of li^t 
at a desired wavelength, etc. 

5 

Nucleic acid detection methods 

Another aspect of the present invention relates to methods and processes for detecting 
polynucleotides. Detection mefliods have a variety of ^plications, including for diagnostic, 
prognostic, foroisic, and research applications. To accomplish gene detection, a 

10 polynucleotide in accordance with the present invention can be used as a "probe." The term 
"piob^' or **polynucleotide probe" has its customary meaning in the art, e.g., a polynucleotide 
which is effective to identify (e.g., by hybridization), when used in an appropriate process, 
tihie presence of a target polynucleotide to which it is designed. Identification can involve 
simply determining presence or absence, or it can be quantitative, e.g., in ass^sing amounts 

15 of a gene or gene transcript present in a sample. Probes can be usefid in a variety of ways, 
such as for diagnostic purposes, to identify homologs, and to detect, quantitate, or isolate a 
polynucleotide of the present invention in a test sample. 

Assays can be utilized which permit quantification and/or presence/absence detection 
of a target nucleic acid in a sample. Assays can be performed at the single-cell level, or in a 

20 sample comprising many cells, where the assay is "averaging'* expression over the mtire 
collection of cells and tissue present in tiie sample. Any suitable assay format can be used, 
including, but not limited to, e.g.. Southern blot analysis. Northern blot analysis, polymerase 
chain reaction ('TCR") (e.g., Saild et al.. Science, 241:53, 1988; U.S. Pat Nos. 4,683,195, 
4,683,202, and 6,040,166; PCR Protocols: A Guide to Methods and Applications, hmis et al., 

25 eds.. Academic Press, New York, 1990), reverse transcriptase polymerase chain reaction 

("RT-PCR"), anchored PCR, rapid ampUfication of cDNA ends CRACE") (e.g., Schaefer in 
Gene Cloning and Analysis: Current Innovations, Pages 99-115, 1997), ligase chain reaction 
("LCR") (EP 320 308), one-sided PCR (Ohara et al., Proc. Natl. Acad. Sci., 86:5673-5677, 
1989), indexing methods (e.g., U.S. Pat No. 5,508,169), in situ hybridization, differential 

30 display (e.g., Uang et al., Nucl. Acid. Res., 21:3269-3275, 1993; U.S. Pat Nos. 5,262,31 1, 
5,599,672 and 5,965,409; W097/18454; Prashar and Weissman, Proc. Natl. Acad. Sci., 
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93:659-663, and U.S. Pat Nos. 6,010,850 and 5,712,126; Welsh et al.. Nucleic Acid Res., 
20:4965-4970, 1992, and U.S. Pat No. 5,487,985) and other KNA fingetprinting techniques, 
nucleic add sequence based amplification C*NASBA") and other transraiption based 
ampUfication systems (e.g., U.S. Pat. Nos. 5,409,818 and 5,554,527; WO 88/10315), 
5 polynucleotide airays (e.g., U.S. Pat Nos. 5,143,854, 5,424,186; 5,700,637, 5,874,219, and 
6,054,270; PCT WO 92/10092; PCT WO 90/15070), Qbeta RepUcase (PCT/US87/00880), 
Strand Displacement Amplification ("SDA"), Repair Chain Reaction ("RCR"), nuclease 
protection assays, subtraction-based metibods, Rapid-Scan™, etc. Additional useful methods 
include, but are not limited to, e.g., tenq)late-based amplification methods, competitive PGR 
. 10 (e.g., U.S. Pat No. 5,747,251), redox-based assays (e.g., U.S. Pat No. 5,871,918), Taqman- 
based assays (e.g., Holland et al., Proc. Natl. Acad, Set., 88:7276-7280, 1991; U.S. Pat Nos. 
5,210,015 and 5,994,063), real-timfe fluorescence-based monitoring (e.g., U.S. Pat 
5,928,907), molecular energy transfer labels (e.g., U.S. Pat Nos. 5,348,853, 5,532,129, 
5,565,322, 6,030,787, and 6,1 17,635; Tyagi and Kramer, Nature Biotech., 14:303-309, 
15 1996). Any method suitable for single cell analysis of gene or protein expression can be 

used, including in situ hybridization, immunocytochemistry, MACS, FACS, flow cytometry. 

For single cell assays, expression products can be measured using antibodies, PCR, or 
other types of nucleic acid amplification (e.g., Brady et al.. Methods Mol. & Cell. Biol. 2, 17- 
25, 1990; Eberwine et al., 1992, Proc. Natl. Acad Set., 89, 3010-3014, 1992; U.S. Pat No. 
20 5,723,290). These and otiier methods can be carried out conventionally, e.g., as described in 
the mentioned publications. 

Many of such methods may require that the polynucleotide is labeled, or comprises a 
particular nucleotide type usefiil for detection. The present invention includes such modified 
polynucleotides that are necessary to carry out such methods. Thus, polynucleotides can be 
25 DNA, RNA, DNA:RNA hybrids, PNA, etc., and can comprise any modification or 
substituent which is effective to adiieve detection. 

Detection can be deshable for a variety of differait purposes, including research, 
diagnostic, prognostic, and forensic. For diagnostic purposes, it may be deshable to identify 
the presence or quantity of a polynucleotide sequence in a sample, where the sample is 
30 obtained fiom tissue, cells, bo<fy fluids, etc. In a preferred method as described in more 
detail below, the present invention relates to a method of detecting a polynucleotide 
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comprising, contacting a target polynucleotide in a test sample with a polynucleotide probe 
under conditions effective to adbdeve hybridization between the target and probe; and 
detecting hybridization. 

Any test sample in which it is desired to identify a polynucleotide or polypeptide 
5 thereof can be used, including, e,g., blood, urine, saliva, stool (for extracting nucleic add, 
see, e.g., U.S. Pat No. 6,177,251), swabs comprising tissue, biopsied tissue, tissue sections, 
cultured cells, etc. 

Detection can be accomplished in combination with polynucleotide probes for other 
genes, e.g., gmes which are expressed in other disease states, tissues, cells, such as brain, 
10 heart, kidney, spleen, thymus, liver, stomach, small intestine, colon, muscle, lung, testis, 
placenta, pituitary, thyroid, skin, adrenal gland, pancreas, salivary gland, uterus, ovary, 
prostate gland, peripheral blood cells (T-cells, lymphocytes, etc.), embryo, breast, fat, adult 
and embryonic stem cells, specific cell-types, such as endodielial, epithelial, myocjrtes, 
adipose, etc. 

15 Polynucleotides can be used in wide range of methods and compositions, including 

for detecting, diagnosing, staging, grading, assessing, prognosticating, etc. diseases and 
disorders associated with polynucleotides, for monitoring or assessing therapeutic and/or 
preventative measures, in ordered arrays, etc. Any method of detecting genes and 
polynucleotides can be used; certainly, the present invention is not to be limited how such 

20 methods are implemented. 

Along these lines, the present invention relates to methods of detecting 
polynucleotides in a sample comprising nucldc add. Such methods can comprise one or 
more the following steps in any effective order, e.g., contacting said sample with a 
polynucleotide probe under conditions effective for said probe to hybridize specifically to 

25 nucleic acid in said sample, and detecting the presence or absence of probe hybridized to 

nucleic acid in said sample, wherein said probe is a polynucleotide which is SEQ ID NOS 1, 
11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, a polynucleotide having, e.g., about 70%, 80%, 
85%, 90%, 95%, 99%, or more sequence identity thereto, effective or specific fi:agments 
thereof, or complements thereto. The detection method can be applied to any sample, e.g., 

30 cultured primary, secondary, or established cell lines, tissue biopsy, blood, urine, stool, 
cerebral spinal fluid, and other bodily fluids, for any purpose. 
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Ck>ntactiiig the sample wifli probe can be carried out by any effective means in any 
effective environment ft can be accomplished in a soUd, Uqtdd, fiozai, gaseous, amorphous, 
soUdified, coagulated, colloid, etc., mixtures thereof matrix. For instance, a probe in an 
aqueous medium can be contacted with a sample which is also in an aqueous medium, or 

5 which is afGxed to a solid matrix, or vice-versa. 

Generally, as used throughout the specification, the term "effective conditions" 
means, e.g., the particular milieu in which the desired effect is achieved. Such a milieu, 
includes, e.g., appropriate buffers, oxidizing agents, reducing agents, pH, co-fectors, 
temperature, ion concentrations, suitable age and/or stage of cell (such as, in particular part of 

10 the cell qrcle, or at a particular stage where particular genes are being expressed) where cells 
are bdng used, culture conditions (inchiding substrate, oxygen, carbon dioxide, etc.). Whrai 
hybridization is the diosen means of achieving detection, the probe and sample can be 
combined such fliat the resulting conditions are functional for said probe to hybridize 
specifically to nucleic acid in said sample. 

1 5 The phrase "hybridize specifically" indicates that the hybridization between single- 

stranded polynucleotides is based on nucleotide sequence complementarity. The effective 
conditions are selected such that the probe hybridizes to a preselected and/or definite target 
nucleic add in tihe sample. For instance, if detection of a polynucleotide set forth in SEQ ID 
NOS 1, 1 1, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58 is desired, a probe can be selected which 

20 can hybridize to such target gene under high stringent conditions, witiiout significant 

hybridization to other genes in the sample. To detect homologs of a polynucleotide set forfli 
in SEQ ID NOS 1, 1 1, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, the effective hybridization 
conditions can be less stringent, and/or the probe can comprise codon degeneracy, such that a 
homolog is detected in the sample. 

25 As ahready mentioned, the methods can be carried out by any effective process, e.g., 

by Northern blot analysis, polymerase chain reaction (PGR), reverse transcriptase PGR, 
RACE PGR, in situ hybridization, etc., as indicated above. When PGR based techniques are 
used, two or more probes are generally used. One probe can be specific for a defined 
sequence which is characteristic of a selective polynucleotide, but the other probe can be 

30 specific for the selective polynucleotide, or specific for a more general sequence, e.g., a 

sequence such as polyA which is characteristic of mRNA, a sequence which is specific for a 
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promoter, ribosome binding site, or o1h« transaiptional features, a consensus sequence (e.g., 
representing a functional domain). For the fonner aspects, 5' and 3' probes (e.g., polyA, 
Kozak, etc.) are prefeired which are capable of spedfically hybridizing to the ends of 
transcripts. When PCR is utilized, the probes can also be referred to as '*primers" in that they 

5 can prime a DNA polymerase reaction 

In addition to testing for the presence or absence of polynucleotides, the present 
invention also relates to determining the amounts at which polynucleotides of Ihe present 
invention are expressed in sample and determining the differential expression of such 
polynucleotides in samples. Such methods can involve substantially the same steps as 

10 described above for presence/absence detection, e.g., contacting with probe, hybridizing, and 
detecting hybridized probe, but using more quantitative mefliods and/or comparisons to 
standards. The amount of hybridization between the probe and target can be determined by 
any suitable methods, e.g., PCR, RT-PCR, RACE PCR, Northern blot, polynucleotide 
microarrays, Rapid-Scan, etc., and includes botii quantitative and qualitative measurements. 

15 

Methods of identifying polymorphisms, mutations, etc., of polynucleotides 

Polynucleotides of the present invention can also be utilized to identify mutant alleles, 
SNPs, gaie learran^ments and modifications, and other polymorphisms of the wild-type 
gene. Mutant alleles, polymorphisms, SNPs, etc., can be identified and isolated fiiom 

20 subjects with diseases that are known, or suspected to have, a genetic component 

Identification of sudi genes can be carried out routinely (see, above for more guidance), e.g., 
using PCR, hybridization techniques, direct sequaidng, mismatch reactions (see, e.g., 
above), RFLP analysis, SSCP (e.g., Orita et al., Proc. Natl Acad, Sci., 86:2766, 1992), etc., 
where a polynucleotide having a sequence selected firom SEQ ID NOS 1 , 1 1 , 16, 25, 38, 43, 

25 45, 47, 49, 5 1 , and/or 58 is used as a probe. The selected mutant alleles, SNPs, 

polymorphisms, etc., can be used diagnostically to determine whether a subject has, or is 
susceptible to a disorder associated with polynucleotides, as well as to design therapies and 
predict the outcome of the disorder. Methods involve, e.g., diagnosing a disorder associated 
with polynucleotides or determining susceptibility to a disorder, comprising, detecting flie 

30 presoice of a mutation in a gene represented by a polynucleotide selected fixjm SEQ ID NOS 
1, 11, 16,25,38,43,45,47,49, 51, and/or 58. The detecting can be carried out by any 
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efifective method, e.g., obtaining cells fiom a subject, detOTnining the gene sequence or 
structure of a target gene (using, e.g., mRNA, cDNA, genomic DNA, etc), comparing the 
sequence or structure of flie target gene to the structure of the nomial gene, whereby a 
difference in sequence or structure indicates a mutation in the gene in the subject. 
5 Polynucleotides can also be used to test for mutations, SNPs, polymorphisms, etc., e.g., using 
mismatch DNA repair technology as described in U.S. Pat. No. 5,683,877; U.S. Pat. No. 
5,656,430; Wu et al., Proc. Natl Acad. Set, 89:8779-8783, 1992. 

The present invention also relates to methods of detecting polymorphisms in genes of 
the present invention, comprising, e.g., comparing the structure of: genomic DNA, mRNA, 
10 cDNA, etc., with the structure of a polynucleotide set forfh in SEQ ID NOS 1, 1 1, 16, 25, 38, 
43, 45, 47, 49, 51, or 58. The methods can be carried out on a sample from any source, e.g., 
cells, tissues, body fluids, blood, urine, stool, hair, egg, spenn,cerebral spmal fluid, etc. 

These methods can be implemented in many different ways. For example, 
"comparing the structure" steps include, but are not limited to, comparing restriction m^s, 
15 nucleotide sequences, amino acid sequences, RFLPs, Dnase sites, DNA methylation 
fingerprints (e.g., U.S. Pat. No. 6,214,556), protein cleavage sites, molecular weights, 
electrophoretic mobiUties, charges, ion mobility, etc.. The term "structure" can refer to any 
physical characteristics or configurations which can be used to distinguish between nucleic 
acids and polypeptides. The methods and instruments used to accomplish flie comparing step 
20 depends upon the physical characteristics which are to be compared. Thus, various 

techniques are contemplated, including, e.g., sequendng machines (both amino acid and 
polynucleotide), electrophoresis, mass spectrometer (U.S. Pat. Nos. 6,093,541, 6,002,127), 
Uquid chromatography, HPLC, etc. 

To carry out such methods, "all or part" of the gene or polypeptide can be compared. 
25 For example, if nucleotide sequencing is utilized, the entire gene can be sequenced, including 
promoter, introns, and exons, or only parts of it can be sequenced and compared, e.g., exon 1, 
exon 2, etc. 

Mutagenesis 

30 Mutated polynucleotide sequences of the present invention are useful for various 

purposes, e.g., to create mutations of the polypeptides they encode, to identify functional 
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regions of genomic DNA, to produce probes for screening Ubraries, etc. Mutagenesis can be 
carried out routinely according to any effective method, e.g., oUgonucleotide-directed (Smith, 
M., Ann. Rev. Genet 19:423-463, 1985), degenerate oligonucleotide-directed (Hill et al.. 
Method Enzymology, 155:558-568, 1987), region-specific (Myers et al.. Science, 229:242- 
5 246, 1985; Derbyshire et al.. Gene, 46:145, 1986; Na: et al., DNA, 7:127, 1988), linker- 
scanning (McKnight and Kingsbury, 5c»e«ce, 217:316-324, 1982), directed using PGR, 
recursive ensemble mutagenesis (Arkin and Yourvan, Proa. Natl. Acad. Set, 89:781 1-7815, 
1992), random mutagenesis (e.g., U.S. Pat. Nos. 5,096,815; 5,198,346; and 5,223,409), site- 
directed mutagenesis (e.g., Walder et al.. Gene, 42:133, 1986; Bauer et al.. Gene, 37:73, 
10 1985; Craik, Bio Techniques, January 1985, 12-19; Smith et al.. Genetic Engineering: 

Principles and Methods, Plenum Press, 1981), phage display (e.g., Lowman et al., Biochem. 
30:10832-10837, 1991; Ladner et al., U.S. Pat. No. 5,223,409; Huse, WO PubUcation WO 
92/06204), etc. Desired sequences can also be produced by the assanbly of target sequences 
using mutually priming oligonucleotides (UWmann, Gene, 71:29-40, 1988). For directed 
1 5 mutagenesis methods, analysis of the three-dimensional structure of the polynucleotides 
polypeptide can be used to guide and facilitate making mutants which effect polypeptide 
activity. Sites of substrate-en2yme interaction or other biological activities can also be 
determined by analysis of crystal structure as determined by such techniques as nuclear 
magnetic resonance, crystallography or photoafifinity labeling. See, for example, de Vos et 
20 al.. Science 255:306-312, 1992; Smith et al., J. Mol. Biol. 224:899-904, 1992; Wlodaver et 
al., FEBS Lett. 309:59-64, 1992. 

hi addition, libraries of polynucleotides and fiagments thereof can be used for 
screening and selection of polynucleotides variants. For instance, a Ubrary of coding 
sequences can be generated by treating a double-stranded DNA with a nuclease under 
25 conditions where the nicking occurs, e.g., only once per molecule, denaturing the double- 
stranded DNA, renaturing it to for double-stranded DNA that can include sense/antisense 
pairs fiom different nicked products, removing single-stranded portions from reformed 
duplexes by treatment with SI nuclease, and Ugating the resulting DNAs into an expression 
vector. By this method, e^qnession libraries can be made comprising •*mutagenized" 
30 polynucleotides. The entire coding sequence or parts thereof can be used. 
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Polymicleotide e7cpressioD9 polypeptides produced thereby, and specific-binding partners 
thereto. 

A polynucleotide according to the present invention can be expressed in a variety of different 
systems, in vitro and in vivo, according to the desired purpose. For example, a 

5 polynucleotide can be inserted into an expression vector, introduced into a desired host, and 
cultured under conditions effective to achieve expression of a polypeptide coded for by the 
polynucleotide, to search for specific binding partners. Effective conditions include any 
culture conditions which are suitable for achieving production of the polypeptide by the host 
cell, including effective temperatures, pH, medium, additives to the media in which flie host 

10 cell is cultured (e.g., additives which amplify or induce expression such as butyrate, or 
methotrexate if the coding polynucleotide is adjacent to a dhfr gene), cycloheximide, cell 
densities, culture dishes, etc. A polynucleotide can be introduced into the cell by any 
effective method including, e.g., naked DNA, calcium phosphate precipitation, 
electroporation, injection, DEAE-Dextran mediated transfection, fiision with liposomes, 

15 association with agents which enhance its uptake into cells, viral transfection. A cell into 
which a polynucleotide of the present inveation has been introduced is a transformed host 
cell. The polynucleotide can be extrachromosomal or integrated into a chromosome(s) of the 
host cell. It can be stable or transient An expression vector is selected for its compatibility 
wifli the host cell. Host cells include, mammalian cells, e.g., COS, CVl, BHK, CHO, HeLa, 

20 LTIC, NIH 3T3, cardiac or heart cells, such as Wl (Wang et al.. In vitro Cell Dev., 27:63-74, 
1991), MC29, cardiac fibroblasts (e.g., Wang et al., Tiss Cell, 33:86-96, 2001), cardiac 
microvascular endothelial cells (e.g. JoUow et al.. Transplantation, 68:430-439, 1999), T/G 
HA-VSMC (CRL-1999), H9c2(2-1) (CRL-1446), P19 (CRH825) , CNS neural stem cells 
(e.g., U.S. Pat. No. 6,103,530), IMR-32, A172 (ATCC CRL.1620), T98G (ATCC CRL- 

25 1690), CCF-STTGl (ATCC CRL-171 8), DBTRG-05MG (ATCC CRL-2020), PFSK-1 

(ATCC CRL-2060), SK-N-AS and otiier SK cell lines (ATCC CRL-2137), CHP-212 (ATCC 
CRL-2273), RG2 (ATCC CRL-2433), HCN-2 (ATCC CRL-10742), U-87 MG and otiier U 
MG cell lines (ATCC HTB-14), D283 Med (ATCC HTB-185), PC12, Neuro-2a (ATCC 
CCL-ni), muscle cells lines, such as RD (CCL-136),.G-7, G-8, C2C12, estabUshed and 

30 primary brain, heart, or muscle ceUs, 0-402 (ATCC CBlL-1440), ACHN (ATCC CKH61 1), 
Vero (ATCC CCL^81), 786-0 (ATCC CRL.1932), 769-P (ATCC CRI^1933), CCD 1 103 
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KIDTr (ATCC CRL-2304), CCD 1 105 KIDTr (ATCC CRL.2305), Hs 835.T (ATCC CRI^ 
7569). Hs 926.T (ATCC CRL-7678), Caki-1 (ATCC HTB-46), Cald-2 (ATCC HTB-47), SW 
839 (ATCC HTB-49), LLC-MBC2 (ATCC CCL-7). BHK-21 (ATCC CCI^IO), MDBK, CV- 
1, (ATCC CRL-1573), KNRK (ATCC CRL-1569), NRK-49F (ATCC CRL-1570). A-704 

5 (ATCC HTB-45), and oflier established and piimaiy kidney lines, CNS neural stem cells 
(e.g., U.S. Pat No. 6,103,530), IMR.32, A172 (ATCC CRL-1620), T98G (ATCC CRI^ 
1690), CCF-STTGl (ATCC CRL-1718), DBTRG-05MG (ATCC CRL-2020), PFSK-1 
(ATCC CRL-2060), SK-N-AS and other SK cell lines (ATCC CRL-2137), CHP-212 (ATCC 
CRI^2273), RG2 (ATCC CRL-2433), HCN-2 (ATCC CRI^10742), U-87 MG and other U 

10 MG cell lines (ATCC HTB-14), D283 Med (ATCC HTB-1 85), PC12, Neuro-2a (ATCC 
CCL-131), and other established and primary brain cell lines. Hep G2 (ATCC NO. HB- 
8065), SK-HEP-1 (ATCC NO HTB-52), H2.35 (ATCC NO CRL-1995), CD-I (ATC NO 
CRI^2254), C3A (ATCC NO CRL-10741), FL83B (ATCC NO CRL-2390), WRL 68 
(ATCC NO CL-48), Hep 3B (ATCC NO HB-8064), insect cells, such as Sf9 (S. fiugipeda) 

15 and Drosophila, bactoia, such as E. coli. Streptococcus, bacillus, yeast, such as 

Sacharomyces, S. cerevisiae, fungal cells, plant cells, embryonic or adult stem cells (e.g., 
maniTnalian, such as mouse or human). 

Expression control sequences are similarly selected for host compatibiUty and a 
desired purpose, e.g., high copy number, high amounts, induction, aiiq)Ufication, controlled 

20 expression. Other sequences which can be employed include enhancers such as from SV40, 
CMV, RSV, mducible promoters, cell-type specific elements, or sequences which allow 
selective or ^ecific cell expression. Promoters that can be used to drive its expression, 
include, e.g., the endogenous promoter, MMTV, SV40, trp, lac, tac, or T7 promoters for 
bacterial hosts; or alpha factor, alcohol oxidase, or PGH promoters Smt yeast. RNA 

25 promoters can be used to produced RNA transcripts, such as T7 or SP6. See, e.g.. Melton et 
al.. Polynucleotide Res., 12(18):7035-7056, 1984; Dunn and Studier. J. Mol. Bio., 166:477- 
435, 1984; U.S. Pat. No. 5,891,636; Studier et al.. Gene Expression Technology, Methods in 
Enzymology. 85:60-89, 1987. In addition, as discussed above, translational signals (including 
in-frame insertions) can be included. 

30 When a polynucleotide is expressed as a h^erologous gene in a transfected (»11 lin^ 

the gene is introduced into a cell as described aibove, under effective conditions in which the 
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graie is expressed. The tenn •'heterologous" means tiiat the gene has been introduced into the 
cell line by Ibe **hand-of-man." Introduction of a gene into a cell line is discussed above. 
TTie transfected (or transformed) cell e^qnessing the gene can be lysed or the cell line can be 
used intact 

5 For expression and otiier purposes, a polynucleotide can contain codons found in a 

naturally-occurring gene, transcript, or cDNA, for example, e.g., as set forth in SEQ ID NOS 
1, 11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, or it can contain degenerate codons coding for 
the same amino acid sequences. For instance, it maybe desirable to change the codons in the 
sequence to optimize the sequence for e^qnession in a desired host See, e.g., U.S. Pat Nos. 

10 5,567,600 and 5,567,862. 

A polypeptide according to the present invention can be recovered fiom natural 
sources, transformed host cells (culture medium or cells) according to the usual methods. 
Another approach is express the polypeptide recombmantiy with an affinity tag (Flag epitope, 
HA epitope, myc epitope, 6xHis, maltose binding protdn, chitinase, etc) and then purify by 

1 5 anti-tag antibody-conjugated affinity chromatography. 

The present invention also relates to polypeptides, e.g., an isolated human polypeptide 
comprising or having the amino add sequence set forth in SEQ ID NOS 1 , 11, 16, 25, 38, 43, 
45, 47, 49, 51, and/or 58, an isolated human polypeptide comprising an amino add sequence 
having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more amino add 

20 sequence identity to the amino add sequence set forth in SEQ ID NOS 1 , 1 1, 16, 25, 38, 43, 
45, 47, 49, 51, and/or 58, and optionally having one or more activities. Fragments can also 
used, e.g., to produce antibodies or other immune responses, as competitors to any activity. 
These fiagments can be referred to as bemg "spedfic for" polynucleotides. The latter phrase, 
as already defined, indicates that the peptides are characteristic of polynucleotides, and that 

25 the defined sequences are substantially absent from all other protein types. Such 

polypeptides can be of any size which is necessary to confer specificity, e.g., 5, 8, 10, 12, 15, 
20, etc. 

The present invention also relates to specific-binding partners. These include 
antibodies which are spedfic for polypeptides encoded by polynucleotides of the present 
30 invention, as well as other binding-partners which intoract witii polynucleotides and 

polypeptides of the present invention. Protein-protein interactions between polynucleotides 
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and other polyp^tides and binding partners can be identified using any suitable methods, 
e.g., protein binding assays (e.g., filtration assays, chromatography, etc.) , yeast two-hybiid 
system (Fields and Song, Nature, 340: 245-247, 1989), protein arrays, gel-shift assays, FRET 
(fluorescence resonance enragy transfer) assays, etc. Nucleic add interactions (e.g., protdn- 
5 DNA or protein-RNA) can be assessed using gel-shift assays, e.g., as carried out in U.S. Pat 
No. 6,333,407 and 5,789,538. 

Antibodies, e.g., polyclonal, monoclonal, recombinant, chimeric, humanized, single- 
chain. Fab, and fragments fliereof, can be prepared according to any desired method. See, 
also, screening recombinant immunoglobulin Ubraries (e.g., Orlandi et al., Proc. Natl. Acad. 
10 Sci., 86:3833-3837, 1989; Huse et al.. Science, 256:1275-1281, 1989); in vitro stimulation of 
lymphocyte populations; Winter and Milstdn, Nature, 349: 293-299, 1991. The antibodies 
can be IgM, IgG, subtypes, IgG2a, IgGl, etc. Antibodies, and immune responses, can also be 
generated by administering naked DNA See, e.g., U.S. Pat. Nos. 5,703,055; 5,589,466; 
5,580,859. Antibodies can be used fix>m any source, including, goat, rabbit, mouse, chidcen 
1 5 (e.g., IgY; see, Duan, WO/029444 for methods of making antibodies in avian hosts, and 

harvesting the antibodies from the eggs). An antibody spedfic for a polypeptide means that 
the antibody recognizes a defined sequaice of amino acids within or including the 
polypeptide. Oflier spedfic binding partners include, e.g., aptamers and PNA. Antibodies 
can be prepared against spedfic epitopes or domains of polynucleotides. 
20 Any form or type of antibody can be prepared and used. For example, antibodies can 

be humanized, e.g., where they are to be used therapeutically. Another form of an antibody 
fragment is a peptide coding fat a single conq)l«nentarity-detennining region (CDR). CDR 
peptides ("minimal recognition units") can be obtained by constructing genes encoding flie 
CDR of an antibody of interest. The term "antibody" as used herdn includes intact 
25 molecules as well as fragments thereof such as Fab, F(ab')2, and Fv which are enable of 

binding to an epitopic determinant. Such antibody fragments retain some abiUty to selectively 
bind with its antigen or receptor. The term "epitope" refers to an antigenic determinant on an 
antigen to whidi the paratope of an antibody binds. Epitopic determinants usually consist of 
diemicaUy active surfrice groupmgs of molecules such as amino adds or sugar side chains 
30 and usually have specific three dimensional structural diaracteristics, as well as specific 
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diaige diaracteristics. Antibodies can be prq>ared against specific ^itopes or polyp^tide 
domains. 

Antibodies which bind to polynucleotides polyp^tides of the preset invention can 
be prepared using an intact polypeptide or fragments containing small peptides of interest as 
the immunizing antigen. For example, it maybe desirable to produce antibodies that 
specifically bind to the N- or C-terminal domains of polynucleotides. The polypeptide or 
peptide used to immunize an animal which is derived firom translated cDNA or chemically 
synthesized which can be conjugated to a carrier protein, if desired. Anti-idiotype technology 
can also be used to produce invention monoclonal antibodies which mimic an epitope. 



Methods of detecting polypeptides 

Polypeptides coded for by polynucleotides of the present invoition can be detected, 
visualized, detemiined, quantitated, etc. according to any effective method, useful methods 
include, e.g., but are not limited to, inmamoassays, RIA (radioimmunassay), EUSA, 

15 (enzyme-linked-immunosorbent assay), immunoflourescence, flow cytometry, histology, 
electron microscopy, light microscopy, in situ assays, immunoprecipitation. Western blot. 

hnmunoassays may be carried in liquid or on biological support For instance, a 
sample (e.g., blood, stool, urine, cells, tissue,cerebral spinal fluid, body fluids, etc.) can be 
brought in contact with and immobilized onto a solid phase svqpport or carrier such as 

20 nitrocellulose, or other solid support that is capable of immobilizing cells, cell particles or 
soluble proteins. The support may fh&a. be washed with suitable buff^ followed by 
treatment with the detectably labeled polynucleotides spedfic antibody. The solid phase 
support can then be washed with a buffer a second time to remove unbound antibody. The 
amount of bound label on solid support may then be detected by conventional means. 

25 A "solid phase support or carrier" includes any support capable of binding an antigen, 

antibody, or other specific binding partner. Supports or carriers include glass, polystyrene, 
polypropylene, polyetiiylene, dextran, nylon, amylases, natural and modified celluloses, 
polyaciylamides, and magnetite. A support material can have any structural or physical 
configuration. Thus, the support configuration may be spherical, as in a bead, or cjdindrical, 

30 as in the inside surfece of a test tube, or Ihe external surface of a rod. Alternatively, the 
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surface maybe flat such as a sheet, test strip, etc. Preferred supports include polystyr«ie 
beads 

One of flie many ways in which gene peptide-spedfic antibody can be detectably 
labeled is by linking it to an enzyme and using it in an enzyme irmnunoassay (EIA). See, 

5 e.g., Voller, A., 'The Enzyme Linked Immunosorbent Assay (ELISA)," 1978, Diagnostic 
Horizons 2, 1-7, Microbiological Associates Quarterly Publication, Walkersville, Md.); 
Voller, A. et al., 1978, J. Clin. Pathol. 31, 507-520; Butler, J. K, 1981, Meth. Enzymol. 73, 
482-523; Maggio, E. (ed.), 1980, Enzyme Immunoassay, CRC Press, Boca Raton, Fla.. The 
enzyme which is bound to the antibody will react with an appropriate substrate, preferably a 

10 chromogenic substrate, in such a manner as to produce a chemical moiety that can be 

detected, for example, by spectrophotometric, fluorimetric or by visual means. Enzymes that 
can be used to detectably label the antibody mclude, but are not limited to, malate 
dehydrogenase, staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol 
dehydrogenase, .alpha.-glycerophosphate, dehydrogenase, triose phosphate isomerase, 

15 horseradish peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, .beta.- 
galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, 
glucoamylase and acetylcholinesterase. The detection can be accomplished by colorimetric 
methods that employ a chromogenic substrate for the enzyme. Detection may also be 
accomplished by visual comparison of the extent of enzymatic reaction of a substrate in 

20 comparison with similarly prepared standards. 

Detection may also be accompUshed using any of a variety of other immunoassays. 
For example, by radioactively labeling the antibodies or antibody fragments, it is possible to 
detect polynucleotides peptides through the use of a radioinmaunoassay (RIA). See, e.g., 
Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on Radioli^d 

25 Assay Techniques, The Endocrine Society, March, 1986. The radioactive isotope can be 
detected by such means as the use of a gamma counter or a scintillation counter or by 
autoradiography. 

It is also possible to label the antibody with a fluorescent compoimd. When the 
fluorescently labeled antibody is exposed to light of the proper wave length, its presence can 
30 then be detected due to fluorescence. Among flie most commonly used fluorescent labeling 
compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin. 



wo 03/085095 PCT/US03/09921 

-55- 

aUophyoocyanm,o-phthaldehydeandfluoresc^^ The antibody can also be detectably 
labeled using fluorescence emitting metals such as those in tihie lantibianide series. These 
metals can be attached to tihe antibody using such metal chelating groups as 
diethylenetriaminqpentacetic add (DTPA) or ethylenediaminetetraacetic acid (EDTA). 

5 The antibody also can be detectably labeled by coupling it to a chemiluminescent 

compound. The presence of the chemiluminesceait-tagged antibody is then deteraiined by 
detecting the presence of Imninescence that arises during the course of a chemical reaction. 
Examples of useful chemiluminescent labeling compounds are Ixmiinol, isoluminol, 
dieromatic acridinium estor, imidazole, acridinium salt and oxalate ester. 

1 0 Likewise, a bioluminescait compound may be used to label the antibody of the 

present invention. Bioluminescence is a type of chemiluminescence found in biological 
systems in which a catalytic protein increases the efficiency of the chemiluminescent 
reaction. The presence of a bioluminescent protein is determined by detecting the presence of 
Ixmiinescence. Important bioluminescent compounds for purposes of labeling are luciferin, 

1 5 luciferase and aequorin. 

Diagnostic 

The present invention also relates to methods and compositions for diagnosing a 
disorder, or detemiining susceptibility to a disorder, using polynucleotides, polypeptides, and 

20 specific-binding partners of the present invention to detect, assess, determine, etc., 

polynucleotides of the present invention. In such methods, the gene can serve as a marker 
for the disorder, e.g., where ttie gene, when mutant, is a durect cause of the disorder; where 
the gene is affected by another gene(s) which is directly responsible for the disorder, e.g., 
when the gene is part of the same signaling pathway as the directly responsible gene; and, 

25 where the gene is chromosomally linked to the gene(s) directly responsible for the disorder, 
and segregates with it. Many other situations are possible. To detect, assess, determine, etc., 
a probe specific for the gene can be employed as described above and below. Any method of 
detecting and/or assessing the gene can be used, includmg detecting expression of the gene 
using polynucleotides, antibodies, or other specific-binding partners. 

30 The present invention relates to methods of diagnosing a disorder assodated with a 

polynucleotide of the present invention, or determining a subject's susceptibility to such 
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disoidCT, comprismg, e.g., assessing the expression of polynucleotide of the present invention 
in a tissue sample comprising tissue or cells suspected of having the disorder. The phrase 
"diagnosing" indicates that it is determined wheflier die sample has the disorder. A 
"disorder" means, e.g., any abnormal condition as in a disease or malady. **Deterajining a 
5 subject's susceptibility to a disease or disorder" indicates that the subject is assessed for 
whether s/he is predisposed to get such a disease or disorder, where the predisposition is 
indicated by abnormal expression of the gene (e.g., gene mutation, gene expression pattern is 
not normal, etc.). Predisposition or susceptibiUty to a disease may result when a such disease 
is influenced by epigenetic, environmental, etc., fectors. Diagnosing includes prenatal 
10 screening where samples fiom the fetus or embryo (e.g., via amniocentesis or CV sampling) 
are analyzed for the expression of the gene. 

By the phrase "assessing expression of a gene or polynucleotide of the present 
invention," it is meant that Ihe functional status of the gene is evaluated. This inchides, but 
is not limited to, measuring expression levels of said gene, determining the genomic structure 
1 5 of said gene, determining the mRNA structure of transcripts fiom said gene, or measuring the 
expression levels of polypeptide coded for by said gene. Thus, the term "assessing 
expression" includes evaliiating the all aspects of the transcriptional and translation^ 
machinery of the gene. For instance, if a promoter defect causes, or is suspected of causing, 
the disorder, dien a sample can be evaluated (i.e., "assessed") by looking (e.g., sequencing or 
20 restriction moping) at the promoter sequence in the geie, by detecting transcription products 
(e.g., RNA), by detecting translation product (e.g., polypeptide). Any measure of whether the 
gene is fimctional can be used, including, polypeptide, polynucleotide, and functional assays 
for the gene's biological activity. 

In making the assessment, it can be useful to compare the results to a normal gene, 
25 e.g., a gene which is not associated widi the disorder. The nature of the comparison can be 
det«inined routinely, depending upon how the assessing is accomplished. If, for example, 
the mRNA levels of a sample is detected, then the mRNA levels of a normal can serve as a 
comparison, or a gene whidi is known not to be affected by the disorder. Methods of 
detecting mRNA are weU known, and discussed above, e.g., but not limited to. Northern blot 
30 analysis, polymerase chain reaction (PCR), reverse transcriptase PGR, RACE PGR, etc. 
Similarly, if polypeptide production is used to evaluate the gene, then tiie polypeptide in a 



wo 03/085095 PCT/US03/09921 

-57- 

noimal tissue sample can be used as a comparison, or, polypeptide ftom a diflferrait gene 
whose expression is known not to be affected by the disorder. These are only examples of 
how such a method could be carried out 

The present invention relates to methods of identifying a gmetic basis for a disease or 
5 disease-susceptibihty, comprising, e.g., determming the association of a disease or disease- 
susceptibility with a gene of the present invention. An association between a disease or 
disease-susceptibiUty and nucleotide sequence includes, e.g., establishing (or finding) a 
correlation (or relationship) between a DNA marker (e.g., gene, VNTR, polymorphism, EST, 
etc.) and a particular disease state. Once a relationship is identified, the DNA marker can be 
10 utilized in diagnostic tests and as a drug target. Any region of the gene can be used as a 
source of the DNA marker, exons, introns, intergraiic regioi^, etc. 

Human linkage msqfs can be constructed to estabUsh a relationship between a gene 
and a disease or condition. Typically, polymorphic molecular markers (e.g., STRP's, SNP's, 
RFLP's, VNTR's) are identified within flie region, linkage and map distance between the 
1 5 markers is then established, and then linkage is established between phenotype and the 
various individual molecular markers. Maps can be produced for an individual family, 
selected populations, patient populations, etc. In general, these methods involve identifying a 
marker associated with the disease (e.g., identifying a polymorphism in a family which is 
linked to the disease) and then analyzing the surromding DNA to identity the gene 
20 responsible for the phenotype. See, e.g., Kruglyak et al.. Am. J. Hum. Genet., 58, 1347-1363, 
1996; Matise et ai.., Nat. Genet., 6(4):384-90, 1994. 

Assessing the effects of ther^eutic and preventative interventions (e.g., 
administration of a drug, chemotherapy, radiation, etc.) is a major effort in drug discovery, 
clmical medicine, and pharmacogenomics. The evaluation of flierapeutic and preventative 
25 measures, whether experimental or akeady in clinical use, has broad applicability, e.g., in 
clinical trials, for monitoring the status of a patient, for analyzing and assessing animal 
models, and m any scenario mvolving disease treatment and prevention. Analyzing the 
expression profiles of polynucleotides of the present invention can be utilized as a parameter 
by which interventions are judged and measured. Treatment of a disorder can change the 
30 expression profile in some manner which is prognostic or indicative of the drug's effect on it 
Changes in the profile can indicate, e.g., drug toxicity, return to a normal level, etc. 
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Accordin^y, the present invCTtion also relates to methods of monitoring or assessing a 
therapeutic or preventative measure (e.g., chemotherapy, radiation, anti-neoplastic drugs, 
antibodies, etc.) in a subject having a disorder, or, susceptible to such a disorder, comprising, 
e.g,, detecting the expression levels of polynucleotides. A subject can be a cell-based assay 

5 system, non-human animal model, human patient, etc. Detecting can be accompUshed as 
described for the methods above and below. By '^therapeutic or preventative mtervention,'' it 
is meant, e.g., a drag adnmiistered to a patient, surgery, radiation, chemother^y, and other 
measures taken to prevent, treat, or diagnose a disorder. Expression can be assessed in any 
sample comprising any tissue or cell ^e, body fluid, etc., as discussed for other methods of 

1 0 the present invention. 



purposes, including, e.g., for diagnostic, therapeutic, and research purposes. Methods can 
involve deUvering or administering an active agent to the target tissue, cornprising, e.g., 
15 administering to a subject in need thereof, an effective amount of an active agent coupled to a 
binding partner specific for a polypeptide of the present invention, wherein said binding 
partner is effective to deliver said active agent specifically to the taret tissue. 



therapeutic, cytotoxic, cytostatic, chemotherapeutic, anti-neoplastic, anti-proliferative, anti- 
20 biotic, etc., agents. A chemotherapeutic agent can be, e.g., DNA-interactive agent, alkylatmg 
agent, antimetabolite, tubulin-interactive agent, hormonal agent, hydroxyurea, Cisplatin, 
Cyclophosphamide, Altretamine, Bleomycin, Dactinomydn, Doxorabicin, Etoposide, 
Teniposide, pacUtaxel, Cytoxan, 2-methoxycaAonylaminobenzmiidazole, Plicamycin, 
Methotrexate, Fluorouracil, Fluorodeoxyuridin, CB3717, Azacitidine, Floxuridme, 
25 Mercapyopurine, 6-Thioguanine, Pentostatin, Cytarabme, Fludarabine, etc. Agents can also 
be contrast agents usefiil in imaging technology, e.g.. X-ray, CT, CAT, MRI, \iltrasound, 
PET, SPECT, and scintographic. 

An active agent can be associated in any manner with a polynucleotides binding 
partner which is effective to achieve its delivery specifically to the target Specific deUvery 
30 or targeting indicates that the agent is provided to the target, without being substantially 
provided to other tissues. This is useful especially where an agent is toxic, and specific 



The present invention also relates to methods of using polynucleotides binding 
partners, such as antibodies, to deliver active agents to a tissue for a variety of different 



Any type of active agent can be used in combination with polynucleotides, including. 
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targeting enables the majority of the toxicity to be aimed at the target tissue, with as small as 
possible effect on other tissues in the body. The association of the active agent and the 
binding partner Q'couphng*) can be direct, e.g., through chemical bonds between flie binding 
partner and the agent, or, via a linking agent, or the association can be less dhect, e.g., where 
5 the active agent is in a liposome, or other carrier, and the binding partner is associated with 
the liposome surface. In such case, the binding partner can be oriented in such a way that it is 
able to bind to polypolypeptide on the cell surface. Methods for deUvery of DNA via a cell- 
surface recqptor is described, e.g., in U.S. Pat. No. 6,339,139. 

1 0 Identifymg agent methods 

The present invention also relates to methods of identifying agents, and the agents 
themselves, which modulate poljaiucleotides and polypeptides of the present invention. 
These agaits can be used to modulate flie biological activity of the polypeptide encoded for 
by the gene, or the gene, itself. Agents which regulate the gene or its product are useful in 

15 variety of different enviromnents, including as medicinal agents to treat or prevent disorders 
associated with polynucleotides of the present invention and as research reagents to modify 
the function of tissues and cell. 

Methods of identifying agents generally comprise steps in which an agent is placed in 
contact with the gene, its transcription product, its translation product, or other target, and 

20 then a detemiination is performed to assess wheflier the agent **modulates" flie target. The 
specific method utilized will depend upon a number effectors, including, e.g., the target (i.e., 
is it the gene or polypeptide encoded by it), the environment (e.g., in vitro or in vivo), the 
composition of the agent, etc. 

For modulating the expression of a gene or polynucleotide, a method can comprise, in 

25 any effective order, one or more of the following steps, e.g., contacting a polynucleotide or 
gene (e.g., in a cell population) with a test agent under conditions effective for said test agent 
to modulate its expression, and determining whether said test agent modulates it An agent 
can modulate expression of polynucleotides at any level, including transcription (e.g., by 
modulating the promoter), translation, and/or perdurance of the nucleic acid (e.g., 

30 degradation, stability, etc.) in the cell. 
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For modulating iiie biological activity of a polypeptide, a method can comprise, in 
any effective order, one or more of the following stq)s, e.g., contacting a polypeptide (e.g., in 
a cell, lysate, or isolated) with a test agent mider conditions effective for said test agent to 
modulate the biological activity of said polypeptide, and determining whether said test agent 

5 modulates said biological activity. 

Contacting polynucleotides with the test agent can be accomplished by any suitable 
method and/or means tiiat places the agent in a position to functionally control expression or 
biological activity. Functional control indicates that the agent can exert its phj^iologjcal 
effect on polynucleotides through whatever mechanism it works. The choice of tiie method 

10 and/or means can depend upon the nature of the agrait and the condition and type of 
environment in which tiie polynucleotides is presented, e.g., lysate, isolated, or in a cell 
population (such as, in vivo, in vitro, organ explants, etc.). For instance, if the cell population 
is an in vitro cell culture, the agent can be contacted with tiie cells by adding it direcdy into 
the culture medium. If the agent cannot dissolve readily in an aqueous medium, it can be 

15 incorporated into liposomes, or another lipophilic carrier, and then administered to the cell 
culture. Ck)ntact can also be facilitated by incorporation of agent with carriers and delivray 
molecules and complexes, by injection, by infusion, etc. 

Agents can be directed to, or targeted to, any part of the polypeptide which is 
effective for modulating it For example, agents, such as antibodies and small molecules, can 

20 be targeted to ceU-surfece, exposed, extracellular, ligand bmding, functional, etc., domains of 
the polypeptide. Agents can also be directed to intracellular regions and domains, e.g., 
regions where the polypeptide couples or interacts with intracellular or intramembrane 
binding partners. 

After the agent has been administered in such a way that it can gain access, it can be 
25 determined whether the test agent modulates expression or biological activity. Modulation 
can be of any type, quality, or quantity, e.g., increase, facilitate, enhance, up-regulate, 
stimulate, activate, amplify, augment, induce, decrease, down-regulate, diminish, lessen, 
reduce, etc. The modulatory quantity can also encompass any value, e.g., 1%, 5%, 10%, 
50%, 75%, l-fbld, 2-fold, 5-fold, 10-fold, 100-fold, etc. To modulate eqnression means, e.g., 
30 that the test agent has an effect on its expression, e.g., to effect the amount of transcription, to 
effect RNA splicing, to effect translation of the RNA into polypeptide, to effect KNA or 
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polypq>tide stability, to effect polyadenylation or other processing of the RNA, to effect post- 
tianscciptional or post-translational processing, etc. To modulate biological activity means, 
e.g., that a functional activity of the polypeptide is chan^ in comparison to its normal 
activity in tiie absence of the agait This effect includes, increase, decrease, block, inhibit, 
5 enhance, etc. 

A test agent can be of any molecular composition, e.g., chCToical compounds, 
biomolecules, such as polypq)tides, lipids, nucleic acids (e.g., antisense to a polynucleotide 
sequence selected fiom SEQ ID NOS 1, 11,16, 25, 38, 43, 45, 47, 49, 51, and/or 58), 
carbohydrates, antibodies, ribozymes, double-stranded RNA, aptamers, etc. For example, if a 

10 polypeptide to be modulated is a cell-surfece molecule, a test agent can be an antibody that 
specifically recognizes it and, e.g., causes the polypeptide to be mtemalized, leading to its 
down regulation on the sur&ce of the cell. Such an effect does not have to be permanent, 
but can reqiiire the presence of the antibody to continue the down-regulatory effect 
Antibodies can also be used to modulate the biological activity of a polypeptide in a lysate or 

15 other cell-free form. 

Additional cell-based test systems suitable for the analj^is of GPCR polypeptides are 
summarized in Marchese et al. (1999, Trends in Pharmacol. Sci. 20: 370-375) and comprise 
so-called "Ugand screening assays." For example in yeast cells the pheromon receptor can be 
replaced by a GPCR accordmg to the invention. The effect of test substances on the recqptor 

20 can be determined upon modulation of histidine synthesis, i.e. by growing in histidine-free 
medium. In addition using cells transfected with nucleic acids according to tiie invention it 
can be analyzed whether test substances mediate translocation of a detectable arrestins, for 
example of a arrestin-GFP-fusion protein. Moreover, it can be analyzed whether test 
substances mediate GPCR-mediated dispersion or aggregation of Xenopus laevis 

25 melanophores. Anotiier test system utilizes the vmiversal adapter G-protein G alphal6, which 
mobilizes Ca.sup.2+. Other screening test systems are described in Lemer et al., supra; 
W096/41169; U.S. Pat No. 5,482,835; WO99/06535; EP 0 939 902; W099/66326; 
W098/34948; EP 0 863 214; U.S. Pat No. 5,882,944 and U.S. Pat No. 5,891,641. 

30 Therapeutics 



Selective polynucleotides, polypeptides, and specific-binding partners thereto, can be 
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utUized in thoapeutic plications. UseM methods indude, but are not limited to, 
immunotherapy (e.g., \ising specific-binding partners to polypeptides), vaccination (e.g., 
using a selective polyp^tide or a naked DNA encoding such polypeptide), protein or 
polypeptide replacement therapy, gene thraapy (e.g., germ-line correction, antisense), etc. 

5 Various immunotherapeutic approaches can be used. For instance, unlabeled 

antibody that ^edfically recognizes a tissue-specific antigen can be used to stimulate flie 
body to destroy or attack a cancer or other diseased tissue, to cause down-regulation, to 
produce complement-mediated lysis, to inhibit cell growth, etc., of target cells which display 
the antigen, e.g., analogously to how c-erbB-2 antibodies are used to treat breast cancer. In 

10 addition, antibody can be labeled or conjugated to enhance its deletaious effect, e.g., with 
radionuclides and other energy emitting entitities, toxins, such as ridn, exotoxin A (ETA), 
and diphtheria, cytotoxic or cytostatic agents, immunomodulators, diemotherapeutic agents, 
etc. See, e.g., U.S. Pat No. 6,107,090. 

An antibody or other specific-binding partner can be conjugated to a second molecule, 

1 5 such as a cytotoxic agent, and used for targeting the second molecule to a tissue-antigen 

positive cell (Vitetta, E. S. et al., 1993, hmnunotoxin therapy, in DeVita, Jr., V. T. et al., eds. 
Cancer: Principles and Practice of Oncology, 4tii ed., J. B. Lippincott Co., Philadelphia, 
2624-2636). Examples of cytotoxic agents include, but are not Umited to, antimetabolites, 
alkylating agents, anthrai^lines, antibiotics, anti-mitotic agents, radioisotopes and 

20 chemotherapeutic agemts. Further examples of cytotoxic agents include, but are not limited to 
ridn, doxorubicin, daunorubidn, taxol, ethidium bromide, mitomydn, etoposide, tenoposide, 
vincristine, vinblastine, coldiicine, dihydroxy anthradn dione, actinomydn D, 1- 
dehydrotestosterone, diptheria toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, elongation 
factor-2 and glucocorticoid. Techniques for conjugating therapeutic agents to antibodies are 

25 well. 

In addition to immunotherapy, polynucleotides and polypeptides can be used as 
targets for non-immunotherapeutic applications, e.g., using compounds which interfere with 
function, expression (e.g., antisense as a therapeutic agent), assembly, etc. RNA interference 
can be used in vitro and in vivo to silence polynucleotides when its expression contributes to 
30 a disease (but also for other purposes, e.g., 1» identify the gene's function to diange a 
developmental pathway of a cell, etc.). See, e.g.. Sharp and Zamore, Science, 287:2431- 
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2433, 2001; Grishok et al., SdencBy 287:2494, 2001. 

Delivery of flierapeutic agents can be achieved according to any effective method, 
including, liposomes, viruses, plasmid vectors, bactaial delivery systems, orally, 
systanically, etc. TTierapeutic agents of the present invention can be administ«%d in any 

5 form by any effective route, including, e.g., oral, parenteral, enteral, intr^eritoneal, toincal, 
transdennal (e.g., using any standard patch), intravenously, ophthahnic, nasally, local, non- 
oral, such as aerosal, inhalation, subcutaneous, intramuscular, buccal, sublingual, rectal, 
vaginal, intra-arterial, and intratiiecal, etc. They can be administered alone, or in 
combination with any ingredient(s), active or inactive. 

10 In addition to therapeutics, per se, the present invaition also relates to methods of 

treating a disease showing altered expression of a polynucleotide or polypeptide of the 
present invention, comprising, e.g., administering to a subject in need thereof a therapeutic 
agent which is effective for regulating expression of said polynucleotide or polypeptide 
which is effective in treating said disease. The term 'treating" is used conventionally, e.g., 

1 5 the management or care of a subject for the purpose of combating, alleviating, reducing, 
relieving, improving the condition o^ etc., of a disease or disorda:. By the phrase "altered 
expression," it is meant that the disease is associated with a mutation in the gene, or any 
modification to the gene (or corre^nding product) which affects its normal function. Thus, 
expression of polynucleotides refers to, e.g., transcription, translation, splicing, stability of 

20 the mKNA or protein product, activity of the g«ie product, differential raqjression, etc. 

Any agent which 'treats" the disease can be used. Such an agent can be one which 
regulates the expression of the polynucleotides. E3q)ression refers to the same acts akeady 
mentioned, e.g. tiranscription, tiranslation, splicing, stability of the mRNA or protein product, 
activity of the gene product, differential expression, etc. For instance, if flie condition was a 

25 result of a complete deficiency of the gene product, administration of gene product to a 
patient would be said to tixsat the disease and regulate the gene's expression. Many other 
possible situations are possible, e.g., where the gene is aberrantiy expressed, and the 
therapeutic agent regulates the aberrant expression by restoring its normal expression pattern. 



30 Antisense 
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Antisense poljoiucleotide (e.g,, RNA) can also be prepared fiom a polynucleotide 
according to flie present invention, preferably an anti-sense to a sequence of SEQ ID NOS 1, 
11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58. Antisense polynucleotide can be used in various 
ways, such as to regulate or modulate e?9ression of the polypeptides tiiey encode, e.g., inhibit 

5 their expression, for in situ hybridization, for therapeutic purposes, for making targeted 

mutations (in vivo, triplex, etc.) etc. For guidance on administering and designing anti-sense, 
see, e.g., U.S. Pat. Nos. 6,200,960, 6,200,807, 6,197,584, 6,190,869, 6,190,661, 6,187,587, ' 
6,168,950, 6^153,595, 6,150,162, 6,133,246, 6,117,847, 6,096,722, 6,087,343, 6,040,296, 
6,005,095, 5,998,383, 5,994,230, 5,891,725, 5,885,970, and 5,840,708. An antisense 

10 polynucleotides can be operably linked to an expression control sequence. A total length of 
about 35 bp can be used in cell culture witii cationic liposomes to fadhtate cellular uptake, 
but for in vivo use, preferably shorter oligonucleotides are administered, e.g. 25 nucleotides. 

Antisense polynucleotides can comprise modified, nonnaturally-occurring nucleotides 
and linkages between the nucleotides (e.g., modification of the phosphate-sugar backbone; 

15 methyl phosphonate, phosphorothioate, or phosphorodithioate linkages; and 2'-0-methyl 

ribose sugar units), e.g., to enhance in vivo or in vitro stability, to confer nuclease resistance, 
to modulate uptake, to modulate cellular distribution and compartmentalization, etc. Any 
effective nucleotide or modification can be used, including those already mentioned, as 
known in the art, etc., e.g., disclosed in U.S. Pat Nos. 6,133,438; 6,127,533; 6,124,445; 

20 6,121,437; 5,218,103 (e.g., nucleoside fliiophosphoramidites); 4,973,679; Sproat et al,, "2'-0- 
Methyloligoribonucleotides: synthesis and applications," Oligonucleotides and Analogs A 
Practical Approach, Eckstein (ed.), IRL Press, Oxford, 1991, 49-86; Wbairen et al., "2'0- 
Alkyl Oligoribonucleotides as Antisense Probes," Proc. Natl. Acad. Sci. USA, 1990, 87, 
7747-7751; Cotton et al., "2'-0-methyl, 2'-0-ethyl oligoribonucleotides and phosphorothioate 

25 oUgodeoxyribonucleotides as inhibitors of the in vitro U7 snRNP-dependent mRNA 
processing event," Nucl. Acids Res., 1991, 19,2629-2635. 



Arrays 

The present invention also relates to an ordered array of polynucleotide probes and 
30 specific-binding partners (e.g., antibodies) for detectmg the expression of polynucleotides or 
polypeptides in a sample, comprising, e.g., one or more polynucleotide probes or specific 
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binding partners associated with a solid support or in separate receptacles, wherein eadi 
probe is specific for polynucleotides, and the probes comprise a nucleotide sequence of SEQ 
ID NOS 1, 11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58 which is specific for said gene, a 
nucleotide sequence having sequence identity to SEQ ID NOS 1,11,16, 25, 38, 43, 45, 47, 

5 49, 5 1 , and/or 58 which is specific for said gene or polynucleotide, or complements thereto, 
or a specific-binding partner which is specific for polynucleotides. 

The phrase "ordered array" indicates that the probes (polynucleotides, binding- 
partners, polypeptides, etc.) are arranged in an identifiable or position-addressable pattern, 
e.g., such as the arrays disclosed in U.S. Pat Nos. 6,156,501, 6,077,673, 6,054,270, 

10 5,723,320, 5,700,637, WO0991971 1, WO00023803. The probes are associated with the 
solid support in any effective way. For instance, the probes can be bound to the solid 
support, either by polymerizing the probes on the substrate, or by attaching a probe to the 
substrate. Association can be, covalent, electrostatic, noncovalent, hydrophobic, hydrophilic, 
noncovalent, coordination, adsorbed, absorbed, polar, etc. When fibers or hollow filaments 

15 are utilized for the array, the probes can fill the hollow orifice, be absorbed into the solid 
filament, be attached to the surface of the orifice, etc. Probes can be of any effective size, 
sequence identity, composition, etc., as already discussed. 

Transgenic flnimals 

20 The present invention also relates to transgenic animals comprising polynucleotides 

genes, and homologs thereof (Methods of making transgenic animals, and associated 
recombinant technology, can be accomplished conveationally, e.g., as described in 
Transgenic Animal Technology, Pinkert et al., 2°^ Edition, Academic Press, 2002.) Such 
genes, as discussed in more detail below, include, but are not limited to, functionally- 

25 disrupted genes, mutated genes, ectopically or selectively-expressed genes, inducible or 

regulatable genes, etc. These transgenic animals can be produced according to any suitable 
technique or method, including homologous recombination, mutagenesis (e.g., ENU, 
Rathkolb et al., Ejq>. Physiol, 85(6):635-644, 2000), and tiie teti:acycline-regulated gene 
expression system (e.g., U.S. Pat. No. 6,242,667). The term "gene" as used herein includes 

30 any part of a gene, i.e., regulatory sequences, promoters, enhancers, exons, introns, coding 
sequences, etc. The polynucleotides nucleic acid present in the conshruct or transgene can b€ 
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naturaUy-occumng wild-type, polymorphic, or mutated. Wh«e the animal is a non-human 
animal, its homolog can be used instead. Transgenic animals can be susceptible to any of the 
dieases and disorders mentioned herem, e.g., as described more particularly under the 
descriptions of each gene. 

5 Along these lines, polynucleotides of the present invention can be used to create 

transgenic animals, e.g. a non-human animal, comprising at least one cell whose graiome 
comprises a functional disruption of a polynucleotide of the present invention, or a homolog 
thereof (e.g., a mouse homolog when a mouse is used). By the phrases "functional 
disruption*' or "functionally disrupted," it is meant that the gene does not express a 

10 biologicaUy-active product. It can be substantially defidait in at least one functional activity 
coded for by the gene. Expression of a polypeptide can be substantially absent, i.e., 
essentially undetectable amounts are made. However, polypqptide can also be made, but 
which is deficient in activity, e.g., where only an amino-terminal portion of the gene product 
is produced. 

1 5 The transgenic animal can comprise one or more cells. When substantially all its 

cells contain the engmeered gene, it can be referred to as a transgenic animal "whose genome 
comprises" the engineered gene. This indicates that the endogenous gene loci of the animal 
has been modified and substantially all cells contain such modification. 

Functional disruption of the gene can be accomplished in any effective way, 

20 including, e.g., introduction of a stop codon into any part of the coding sequence such that the 
resulting polypeptide is biologically inactive (e.g., because it lacks a catalytic domain, a 
ligand binding domain, etc.), introduction of a mutation into a promoter or other regulatory 
sequence that is effective to turn it ofl^ or reduce transcription of the gene, msertion of an 
exogenous sequence into the gene which inactivates it (e.g., which disrupts the production of 

25 a biologically-active polypeptide or which disrupts the promoter or other transcriptional 
machinery), deletion of sequences firom the gene (or homolog thereof), etc. Examples of 
transgenic animals having fimctionaUy disrupted genes are well known, e.g., as described in 
U.S. Pat. Nos. 6,239,326, 6,225,525, 6,207,878, 6,194,633, 6,187,992, 6,180,849, 6,177.610, 
6,100,445, 6.087,555, 6,080,910. 6,069,297, 6,060,642. 6.028,244, 6,013,858, 5,981,830, 

30 5,866,760, 5,859,314, 5,850,004, 5.817,912. 5.789,654, 5,777,195, and 5,569,824. A 
transgenic animal which comprises the functional disruption can also be referred to as a 
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•Tmock-ouf' animal, since the biological activity has been 'Tmoclced-ouf ' Knock-outs can be 
homozygous or heterozygous. 

For creating functionally disrupted genes, and other gene mutations, homologous 
recombination tedmology is of special interest since it allows specific regions of the genome 
5 to be targeted. Using homologous recombination methods, genes can be specifically- 
inactivated, specific mutations can be introduced, and exogenous sequences can be 
introduced at specific sitra. These mefliods are well known in the art, e.g., as described in the 
patents above. See, also, Robertson, i?^roi/«c., 44(2):238-245, 1991. Generally, the 
genetic engineering is periformed in an embryonic stem (ES) cell, or other pluripotent cell line 
1 0 (e.g., adult stem cells, EG cdls), and that genetically-modified cell (or nucleus) is used to 
create a whole organism. Nuclear transfer can be used in combination with homologous 
recombination technologies. 

For example, the polynucleotides locus can be disrupted in mouse ES cells using a 
positive-negative selection method (e.g., Mansour et al.. Nature^ 336:348-352, 1988). In this 
15 method, a targeting vector can be constructed which comprises a part of the gene to be 
targeted. A selectable marker, such as neomycin resistance genes, can be inserted into a 
polynucleotides exon present in the targeting vector, disrupting it When the vector 
recombines with the ES cell genome, it disrupts the function of the gene. The presence in the 
cell of the vector can be detomined by expression of neomycin resistance. See, e.g., U.S. 
20 PaL No. 6,239,326. Cells having at least one fimctionally disrupted gme can be used to 
make chimeric and gomline animals, e.g., animals having somatic and/or germ cells 
comprising the engineered gene. Homozygous knock-out animals can be obtained fiom 
breeding heterozygous knock-out animals. See, e.g., U.S. Pat. No. 6,225,525. 

The presmt invention also relates to non-human, transgenic animal whose goiome 
25 comprises recombinant polyaucleotides nucleic acid (and homologs thereof) operatively 
linked to an expression control sequence effective to express said coding sequence. Such a 
transgenic animal can also be referred to as a 'Imock-in" animal since an exogenous gene has 
been introduced, stably, into its genome. 

A recombinant nucleic acid refers to a polynucleotide which has been introduced into 
30 a target host cell and optionally modified, sudi as cells derived fiom animals, plants, bacteria, 
yeast, etc. A recombinant nucl^c acid mcludes completely synthetic nucleic add sequoices. 
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srani-synthetic nucleic add sequences, sequences derived fiom natural sources, and chimeras 
thereof. "Operable linkage" has the meaning used through the specification, ie., placed in a 
functional relationship with another nucleic add. When a gene is operably linked to an 
e}q>ression control sequence, as explained above, it indicates that the geaie (e.g., coding 
5 sequence) is joined to the expression control sequaice (e.g., promoter) in sudi a way that 
facilitates transcription and translation of the coding sequence. As described above, the 
phrase "genome" indicates that the genome of the cell has been modified. In this case, flie 
recombinant polynucleotide has been stably integrated into the genome of the animal. The 
nucleic add (e.g., a coding sequoice) in operable linkage with die expression control 
1 0 sequence can also be refeaed to as a construct or transgene. 

Any expression control sequraice can be \ised depending on the purpose. For 
instance, if selective expression is desired, then expression control sequences whidi limit its 
expression can be selected. These include, e.g., tissue or cell-specific promoters, introns, 
enhancers, etc. For various methods of cell and tissue-spedfic expression, see, e.g., U.S. Pat 
15 Nos. 6,215,040, 6,210,736, and 6,153,427. These also include the endogenous promoter, i.e., 
the coding sequence can be operably linked to its own promoter. Mdudble and regulatable 
promoters can also be utilized. 

The present invention also relates to a transgenic animal which contains a functionally 
disrupted and a transgene stably integrated mto the animal's genome. Such an animal can be 
20 constructed using combinations any ofthe above- and below-mentioned mediods. Such 
animals have any of the aforemaitioned uses, including permitting the knock-out of the 
normal gene and its replacement with a mutated gene. Sudi a transgene can be integrated at 
the endogenous gaie locus so that flie functional disruption and "knock-in" are carried out in 
the same step. 

25 In addition to the methods mentioned above, transgenic animals can be prepared 

according to known methods, including, e.g., by pronuclear injection of recombinant genes 
into pronucld of 1-cell embryos, incorporating an artifidal yeast chromosome into 
embryonic stem cells, g«ie targeting metiiods, embryonic stem cell methodology, cloning 
methods, nuclear traosf^ methods. See, also, e.g., U.S. Patent Nos. 4,736,866; 4,873,191; 

30 4,873,316; 5,082,779; 5,304,489; 5,174,986; 5,175,384; 5,175.385; 5,221,778; Gordon et al., 
Proc. Natl. Acad. Sd., 77:7380-7384, 1980; Pahniter et al.. Cell, 41:343-345, 1985; Pahniter 
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et al., Ann. Rev. Gaiet, 20:465-499, 1986; Askew et al., MoL CeU. Bio., 13:41 15-4124, 
1993; Games et al. Nature, 373:523-527, 1995; Valandus and Smitbies, Mol. CeU. Bio., 
1 1:1402-1408, 1991; Stacey et al., Mol. CeU. Bio., 14:1009-1016, 1994; Hasty et al.. Nature, 
350:243-246. 1995; Rubinstein et al., Nucl. Add Res., 21:2613-2617,1993; CibeUi et al., 
5 Sdence, 280: 1256-1258, 1998. For guidance on recombinase exdsion systans, see, e.g., 
U.S. Pat. Nos. 5,626,159, 5,527,695, and 5,434,066. See also, Orban, P.C., et al., "Tissue- 
and Site-Spedfic DNA Recombination in Transgenic Mice," Proc. Natl. Acad. Sd. USA, 
89:6861-6865 (1992); O'Gorman, S., et al., "Recombinase-Mediated Gene Activation and 
Site-Spedfic Integration in MammaUan Cells," Sdence, 251:1351-1355 (1991); Sauer, B., et 
10 al., "Cre-stimulated recombination at loxP-Containing DNA sequences placed into the 

mammaUan genome," Polynucleotides Research, 17(1): 147-161 (1989); Gagneten, S. et al. 
(1997) Nucl. Adds Res. 25:3326-3331; Xiao and Weaver (1997) Nucl. Adds Res. 25:2985- 
2991; Agah, R. et al. (1997) J. CUn. Invest 100:169-179; Barlow, C. et al. (1997) Nucl. 
Adds Res. 25:2543-2545; Araki, K. et al. (1997) Nucl. Adds Res. 25:868-872; Mortensen, 
15 R. N. et al. (1992) Mol. CeU. Biol. 12:2391-2395 (G418 escalation method); Lakhlani, P. P. 
et al. (1997) Proc. Natl. Acad. Sci. USA 94:9950-9955 ("hit and run"); Westphal and Leder 
(1997) Curr. Biol. 7:530-533 (transposon-generated "knock-out" and '*knock-in"); 
Templeton, N. S. et al. (1997) Gene Ther. 4:700-709 (methods for effideat gene targeting, 
aUowiag for a high frequency of homologous recombination events, e.g., without selectable 
20 markers); PCT Ihtemational PubUcation WO 93/22443 (functionaUy-disrupted). 

A polynucleotide according to thepresent invention can be introduced into any 
non-human animal, including a non-human mammal, mouse (Hogan et al.. Manipulating the 
Mouse Embrvo: A Laboratorv Manual. Cold Soring Harbo r Laboratorv. Cold Spring Harbor, 
New York, 1986), pig (Hammer et al.. Nature, 315:343-345, 1985), sheep (Hammer et al., 
25 Nature, 3 1 5 :343-345, 1 985), cattle, rat, or primate. See also, e.g.. Church, 1 987, Trends in 
Biotech. 5:13-19; Clark et al.. Trends in Biotech. 5:20-24, 1987); and DePamphiUs et al., 
BioTechniques, 6:662-680, 1988. Transgenic animals can be produced by the methods 
described in U.S. Pat No. 5,994,618, and utilized for any of the utUities described therein. 



30 Database 

The present iavention also relates to electronic forms of polynucleotides. 
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polypeptides, etc., of the present invention, including computer-readable medium (e.g., 
magnetic, optical, etc., stored in any suitable format, such as flat files or hierarchical files) 
which comprise such sequences, or fiBgments thereof e-commerce-related means, etc. 
Along these lines, the present invention relates to methods of retrieving gene sequences from 

5 a computer-readable mediunti, comprising, one or more of the following steps in any efiGsctive 
order, e.g., selecting a cell or gene expression profile, e.g., a profile that specifies that said 
gene is expressed in a particular (see the expression profiles described above), and retrieving 
said differentially expressed gene sequences, where the gene sequences consist of the genes 
represented by SEQ ID NOS 1, 11, 16, 25, 38, 43, 45, 47, 49, 51, and/or 58, or the 

1 0 polypeptides encoded thereby. 

A "gene expression profile" means the list of tissues, cells, etc., in which a defined 
gene is expressed (i.e, transcribed and/or translated). A "cell expression profile" means the 
genes which are expressed in the particular cell type. The profile can be a list of the tissues 
in which the gene is expressed, but can include additional information as well, including 

1 5 level of expression (e.g., a quantity as compared or normalized to a control gene), and 

information on temporal (e.g., at what point in the cell-cycle or developmental program) and 
spatial expression. By the phrase "selecting a gene or cell expression profile," it is meant that 
a user decides what type of gene or cell expression pattern he is interested in retrieving. Any 
pattern of expression preferences may be selected. The selecting can be performed by any 

20 effective method, hi general, "selecting" refers to the process m which a user forms a query 
ttiat is used to search a database of gene expression profiles. The step of retrieving involves 
searching for results in a database that correspond to the query set forth in the selecting step. 
Any suitable algorithm can be utilized to perform the search query, including algorithms that 
look for matches, or that perform optimization between query and data. The database is 

25 information that has been stored in an appropriate storage medium, having a suitable 

computer-readable format. Once results are retrieved, they can be displayed in any suitable 
format, such as HTML A query is formed by the user to retrieve the set of genes from the 
database having the desired property. Once the query is inputted into the system, a search 
algorithm is used to interrogate the database, and retrieve results. 

30 

Advertising, licensing, etc., methods 
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The present invention also relates to methods of advertising, licensing, selling, 
purchasing, brokering, etc., gmes, polynucleotides, spedfic-binding partnras, antibodies, etc., 
of the present invention. Methods can comprises, e.g., displaying a gene, polynucleotide, 
polypeptide, or antibody specific for a polypeptide in a printed or computer-readable medium 
5 (e.g., on the Web or Internet), accepting an offer to purdiase said gene, polypeptide, or 
antibody, etc. 

Other 

A polynucleotide, probe, polypeptide, antibody, specific-binding partner, etc., 

10 according to the present invaition can be isolated. The term "isolated" means tiiat tiie 
material is in a fimn in which it is not found in its original environment or in nature, e.g., 
more concenhated, more purified, separated firom component, etc. An isolated 
polynucleotide includes, e.g., a polynucleotide having the sequenced separated firom tiie 
chromosomal DNA found in a living animal, e.g., as the complete gene, a transcript, or a 

1 5 cDNA. This polynucleotide can be part of a vector or inserted into a chromosome (by 

specific gene-targeting or by random integration at a position otiier than its normal position) 
and still be isolated in that it is not in a form fliat is found in its natural environment. A 
polynucleotide, polypeptide, etc., of the presait invention can also be substantially purified. 
By substantially purified, it is meant tiiat polynucleotide or polypeptide is separated and is 

20 essentially fiee fiwm other polynucleotides or polypeptides, Le., the polynucleotide or 
polypeptide is the primary and active constituent. A polynucleotide can also be a 
recombinant molecule. By "recombinant," it is meant that the polynucleotide is an 
arrangement or form which does not occur in nature. For instance, a recombinant molecule 
comprising a promoter sequence would not encompass tiie natinrally-occurring gene, but 

25 would include tiie promoter operably linked to a coding sequence not associated with it in 
natui«, e.g., a reporter gene, or a truncation of the normal coding sequence. 

The term '*niarker" is used herein to indicate a means for detecting or labeling a 
target A marker can be a polynucleotide (usually referred to as a "probe"), polypeptide (e.g., 
an antibody conjugated to a detectable label), PNA, or any effective material. 
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The topic headings set forth above are meant as guidance where certain information 
can be fonnd in flie application, but are not intended to be the only sonice in the application 
yrbiae information on such topic can be found. Reference materials 

For other aspects of the polynucleotides, referaice is made to standard texibooks of 

5 molecular biology. Sp^, ^ g , Hames et al.. Polynucleotide Hybridization. IL Press, 1985; 
Davis et al., Basic Me&ods in Molecular Biologv« Elsevir Sciences Publishing, Inc., New 
York, 1986; Sambrook et al., TV/r»iprmlar rinninff CSH Press, 1989; Howe, Gene CloninR 
and Manipulation. Cambridge University Press, 1995; Ausubel et al., Currait Protocols in 
Molecular Biology. John Wiley & Sons, Inc., 1994-1998. 

10 Without fur&er elaboration, it is beUeved that one skilled in the art can, using the 

preceding description, utilize the present invention to its fullest exteat The following 
preferred specific embodiments are, therefore, to be construed as merely illustrative, and not 
limitative of the remainder of the disclosure in any way whatsoever. The entire disclosure of 
all applications, patents and publications, cited above and in the figures are hoeby 

15 incorporated by reference in their entirety, including U.S. Serial No. 10/094,092, filed Mareh 
11, 2002, U.S. Serial No. 10/112,372, filed April 1, 2002, U.S. Serial No. 60/382, 614, filed 
May 24, 2002, U.S. Serial No. 10/164,717, filed June 10, 2002, U.S. Serial No. 10/167,631, 
filed June 13, 2002, U.S. Serial No. 10/177,917, filed June 24, 2002, and U.S. Serial No. 
60/399,125, filed 30 July 2002, which are hereby incorporated by reference in their mtirety. 



20 





wo 03/085095 



PCT/US03/09921 



73 



TABLE 1 



Variant' 



Nucleotide diang6 



PtolSAla 



520G 



86iiisA 




Val60Leu 


178G>T 


Ala64Ser , 


190(3 >T 


Aig67Glu. 


..200G>A. 




2271* •> A 


Asp84Glu 


252C>A 


AlaSlPro 


241G>C 


Val92Met 


. 274G>A 


Tht95Met 


284C>T 


Val97ne 


289Q>A 


AlalOSVal 


3080 T 


Glyi04Ser 


310G>A 


Leul06GIn 


317T>A 


LeulQ6Lefi 


318G>A 


Aisl42His. 


425G>A 


ArglSlCys 


451C>T 


ArglSlArg 


453C > G 


Ilel55Thr 


464T>C ■ 


Argl60Tip 


4780T 


AigieSGltt 


488G>A 


Yall73del 




Vall74Ile 


520G>A 


537insC 




Flr{o230Leu 


689C>T 


Ptt)230Pro 


^90G>A 


Glii233Gln 


699G>A 


His26DPro 


779A>C. 


Ile264Ile 


792G>T 


Cy^SCys 


"8190 >T 


Lys278Glu 


832A>G 


Asn279Ser 


836A>G 


Asn279Lys 


8370A 


IIe287Met 


861C.>G 


Asp294Hls 


880Q>C 


PbeSOOPhe 


9«)C>T 


Thi>314Thr 


942A>G 


^tet316Ser 


9480T 
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♦ 

Allele 


Allele FmqiieiiOKi% 

' Wbfte lidiStdoals ' 
Po|HilaUoi» Wffli M Hafr 


ofcftRIP 
ProdncQon 


Wild tVDft 


53 


23 


-H+ 




10 


8 


■f 




<1 


1 


HA 




<1 


<1 


NA 






0 


NA 






0 


NA 


rn6#Diyr 


<1 


<1 


NA 




1 


3 


444 




<1 


0 


Wi 




<1 


1 


NA 




8 


8 






<1 


1 


NA 




.<1 


<1 


NA 


AlalOSVkl 


<1 


<1 


NA 


liA?U f VVUHll 


<1 


<1 


NA 




<1 


1 




AlVlSICVSS 


8 


25 


_ 


lle155Thr 


<1 


<1 


NA 


Aig16inkp§ 


7 


18 




Arg163Gln 




<1 


NA 


ll62S7Met 




0 


NA 




4 


13 




Ala299rnir 


<1 


<1 


NA 


!ns291l 


<1 


<1 




tns1701| 


<1 


<1 





^Several anonymous rariante have also been destaf bed. IndiKfing 
UuiOeUu. UulSSUu, ein233Gln. Cy8273Gys. PheSOOPhe, Th(314nir. and 
Sef316$en MG1-R Indicates melanocorfin-l receptor cAMR c^Wa 
adenosine monopho^hate: Mple plus sign, ^'gniflcant stiraidatlon (m» as 
wild type); slEigte plus sign, minimal stimulailon; NA. data notavaOabie; and 
minus sign, no stimulation (nonfunctional receptor), 

tPbsslbte assocbllon with biond/£dr tiali: 

iPresent In <1% of East/Southeast Asians. 

istrong associalion with red hair. talrsKln. and poor tanning abl% receM 
work also shows an ^sodation with cutaneous m^anoma and 
nonnudanontti sidn cancer. 

§PrQsentfn >70% of East/Southeasl Asian and NaUve Americans. 

$ns Incficaies Insertion; th^ dogle-nudeotide in^nion mutations 
juxKtui^J&ani^isldfi&lleiXB nonfundionlmi 



